SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION: 

( i i i ) NUMBER OP SEQUENCES: 27 

( 2 ) INFORMATION FOR SEQ ID NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 5943 baas pnri 
( B ) TYPE: aoclac add 
( C ) STRANDEDNBSS: doofak 
( D )TOPOLOGY: bar 

( i i ) MOLECULE TYPE: DNA (jn i ink ) 

( t i i ) IMMEDIATE SOURCE.- 

( B ) CLONE: Bam* Ffanoan A-ah*a 

( i z ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: ya*0UK 1134-1279. 1739.1922.3035-3200. 
37*6-3210) 

( z i )SBQUENCE DESCRIPTION: SBQ ID NOtl: 
OTCTAOOAOC CAOCCCCACC CTTAOAAAAO ATO TTT TCC ATO AGO AT C OTC TOC 34 

Mat Pbc Sei Mat Arg I 1 • Val Cyi 

1 5 

CTA OTT CTA AOT OTO OTO OOC ACA OCA TOO OTATOOCCCT TTTCATTTTT 104 
Lao Val L ♦ n Sar Val Val Oly Tbr Ala Trp 
10 13 

TCTTCTTOCT TTCTCTCTOO TOTTTATTCC ACAAAOAOCC TOOAOOTCAO AOTCTACCTO 164 

CTCTATOTCC TOACACACTC TTAOCTTTAT OACCCCAOOC CTOOOAOOAA ATTTCCTOOO 224 



8 3| 
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-continued 

TOOOCTTOAC ACCTCAAOAA TACAOOOTAA TATOACACCA AOAOOAAOAT CTTAOATOOA 2»4 

TOAOAOTOTA CAACTACAAO OOAAACTTTA OCATCTOTCA TTC AOTCTTA CCACATTTTO 344 

TTTTOTTTTO TTTTAAAAAO OOCAAOAATT ATTTOCCATC CTTOT ACCT A TAAAOCCTTO 404 

OTOCATTATA ATOCTAOTTA ATOOAATAAA ACATTTTATO OTAAOATTTO TTTTCTTTAO 464 

TTATTAATTT CTTOCTACTT OTCCATAATA AOCAO AACTT TTAOTOTTAO TACAOTTTTO 324 

CTOAAAOOT T ATTOTTOTOT TTOTCAAOAC AOAAOAAAAA OCAAACOAAT TATCTTTOOA 3 64 

A AT ATCTTTO C AO TAT C AO A AOAOATTAOT TAOTAAOOCA AT ACOCT TTT CCOCAOTAAT 644 

OOTATTCTTT TAAATTATOA ATCCATCTCT AAAOOTTACA TAOAAACTTO AAOOAOAOAO 704 

OAACATTCAO TTAAOATAOT CTAOOTTTTT CTACTOAAOC AOCAATTACA OOAOAAAOAO 764 

CTCTACAOTA OTTTTCAACT TTCTOTCTOC AOTCATTAOT AAAAATOAAA AOOTAAAATT S24 

TAACTOATTT TATAOATTCA AATAATTTTC CTTTTAOOAT OOATTCTTTA AAACTCCTAA 114 

TATTTATCAA ATOCTTATTT AAOTOTCACA CACAOTTAAO AAATTTOTAC ACCTTOTCTC 944 

CTTTAATTCT CATAACAACT CCATAAAATO OOTCCTAOOA TTTCCATTTO AAOATAAOAA 1004 

ACCTOAAOCT TOCCOAAOCC CTOTOTCTOC TCTCCTTAAT CTCTOTOAOA OTOCCATCTC 1064 

TTCCTOOOOA CTTOTAOOCA TOCCACTOTC TCCTCTTCTO OCTAACATTO CTOTTOCTCT 1124 

CTTTTOTOTA TOTOAATOAA TCTTT A A AO ACT OCA OAT AOT OOT OA A OOT OAC 1177 

Tbr Ala A • p Sar Oly Ola Oly Alp 

2 0 2 3 

TTT CT A OCT OA A OOA OOA OOC OTO COT OOC CCA AOO OTT OTO OA A AO A 1223 

Pbc Lao Ala Ola Oly Oly Oly Val Arg Oly Pro Arg Val Vil Ola Arg 
3 0 3 5 40 

CAT CAA TCT OCC TOC AAA OAT TCA OAC TOO CCC TTC TOC T CT OAT O A A 1273 

His Ola Sar Ala C y • L y • Asp Sat Atp Trp Pro Pba Cyt Sar Atp Olo 
4 5 5 0 3 5 

OAC TOO OTAAOCAOTC AOCOOOOOAA OCAOOAOATT CCTTCCCTCT OATOCTAOAO 13 29 

Asp Trp 
6 0 

OOOCTCACAO OCTOACCTOA TTOOTCCCAO AAACTTTTTT AAATAOAAAA TAATTOAATA 13 19 

OTTACCTACA TAOCAAATAA AOAAAAOOAA CCTACTCCCA AOAOCACTOT TTATTTACCT 1449. 

CCCCAACTCT OOATCATTAO TOOOTOAACA OACAOOATTT CAOTTOCATO CTCAOOCAAA 1509 

ACCAOOCTCC TOAOTATTOT OOCCTCAATT TCCTOOCACC TATTTATOOC TAAOTOOACC 1569 

CTCATTCCAO AOTTTCTCTO COACCTCTAA CTAOTCCTCT TACCTACTTT TAAOCCAACT 1629 

TATCTOOAAO AOAAAOOOTA OOAAOAAATO OOOOCTOCAT OOAAACATOC AAAATTATTC 16S9 

TOAATCTOAO AOATAOATCC TTACTOTAAT TTTCT CCCTT CACTTTCAO AAC TAC 1744 

A t a T y r 

AAA TOC CCT TCT OOC TOC AOO ATO AAA OOO TTO ATT OAT OAA OTC A AT 1792 

Lys Cys Pro Sar Oly Cys Arg Mat Lys Oly Laa 11a Atp Olo Val Asa 
6 5 7 0 7 5 

CAA OAT TTT ACA AAC AO A ATA A A T A AO CTC AAA A AT TCA CT A TTT OAA 1140 

Ola A • p Pba Tbr An Arg lla Asa Lyt Lao Lys Asa Sar Lao Pba Olo 
SO 15 9 0 

TAT CAO A AO AAC A A T A A O OAT TCT CAT TCO TTO ACC ACT A AT ATA ATO llll 

Tyr Ola Lys Asa Asa Lys Asp Sar Bis Sar Lao Tbr Tbr Asa lla Mat 
* 3 1 00 1 03 1 1 0 

OAA ATT TTO AO A OOC OAT TTT TCC TCA OCC A AT A OTAAOTATTA 1932 
Ola lla Lao Arg Oly Asp Pba Sar Sar Ala Asa 

113 120 



CATATTTACT TCTTTOACTT TATAACAOAA ACAACAAAAA TCCTAAATAA ATATOATATC 



1992 
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-continued 

COCTT AT ATC TATOAC AATT TCATCCCAAA OTACTTAOTO TAOAAACACA TACCTTCATA 205 2 

ATATCCCTOA AAATTTTAAO AOOOAOCTTT TOTTTTCOTT ATTTTTTCAA AO TAAA AO AT 3 112 

OTTAACTOAO ATTOTTT AAO OTCACAAAAT AAOTCAOAAT TT TOO ATT A A AACAAOAATT 2 173 

TAAATOTOTT CTTTTC AACA OTATATACTO AAAOT AOOAT OOOTCAOACT CTTTOAOTTO 233 3 

ATATTTTTOT TTCTOCTTTO TAAAOOTOAA AACTOAOAOO TC A AOOAACT TOTTCAAAOA 3 3 93 

CACAOAOCTO OOAATTCAAC TCCCAOACTC CACTOAOCTO ATTAOOTAOA TTTTTAAATT 335 3 

TAAAATATAO OOTCAAOCTA COTCATTCTC ACAOTCTACT CATTAOOOTT AOOAAACATT 34 13 

OCATTCACTC TOOOCATOOA CAOCOAOTCT AOOOAOTCCT CAOTTTCTCA AOTTTTOCTT 3473 

TOCCTTTTTA CACCTTCACA AACACTTOAC ATTT AAAATC AOTOATOCCA ACACTAOCTO 3333 

OCAAOTOAOT OATCCTOTTO ACCCAAAACA OCTTAOOAAC CATTTCAAAT CTAT AOAOTT 3 3 93 

AAAAAOAAAA OCTCATCAOT A AO AAA AT CC AATATOTTCA AOTCCCTTOA TTAAOOATOT 363 3 

TATAAAATAA TTOAAATOCA ATCAAACCAA CTATTTTAAC TCC AAATTAC ACCTTTAAAA 37 13 

TTCCAAAOAA AOTTCT TCTT CTATATTTCT TTOOO ATT AC TAATTOCTAT TAOOACATCT 3 7 73 

TAACTOOCAT TCATOOAAOO CTOCAOOOCA TAACATTATC CAAAAOTCAA ATOCCCCATA 2133 

OOTTTTOAAC TCACAOATTA AACTOTAACC AAAATAAAAT TAOOCATATT TACAAOCTAO 2192 

TTTCTTTCTT TCTTTTTTCT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT 2952 

CTTT CTTTCT TTCTCCTTCC TTCCTTTCTT CCTTTCTTTT TTOCTOOCAA TTACAOACAA 30 12 

ATCACTCAOC AOCTACTTCA ATAACCATAT T-TT COATTTC AO AC COT OAT AAT 3063 

A a a Arf Aip A • a 

123 

ACC TAC AAC COA OTO TC A OAO OAT CTO AO A AOC AO A ATT OAA OTC CTO 3 113 

Tbr Tyi Asa Arg Val Sat Ola Aip Laa Art Sir Arg II* Ola Val Lid 
13 0 13 5 140 

AAO COC AAA OTC ATA OAA AAA OTA CAO CAT ATC CAO CTT CTO CAO AAA 3 16 1 

Ly# Arg Lyi Val lie Olo Lyr Val Ola Bis Ita Ola Lam Lao Ola Lyt 
14 5 15 0 15 5 

AAT OTT AO A OCT CAO TTO OTT OAT ATO AAA COA CTO OAO OTAAOTATOT 32 10 

A • a Val Arg Ala Ola Lis Val Atp Mat Lyt Arg Lao Ola 

160 165 170 

OOCTOTOOTC CCOAOTOTCC TTOTTTTTO A OTAOAOOOAA AAOOAAOOCO AT AOTT ATOC 3 270 

ACTOAOTOTC TACTATATOC AOAOAAAAOT OTTATATCCA TCATCTACCT AAAAOTAOOT 3330 

ATTATTTTCC TCACTCCACA OTTOAAOAAA AAAAAATTCA O AOAT ATTAA OTAAATTTTC 3 390 

CAACOTACAT AOATAOTAAT TCAAAOCAAT OTTCAOTCCC TOTCT ATTCC AAOCCATTAC 3450 

ATCACCACAC CTCTOAOCCC TCAOCCTOAO TTCACCAAOO ATCATTTAAT TAOCOTTTCC 35 10 

TTTOAOAOOO AATAOCACCT TACTCTTOAT CCATTCTOAO OCTAAOATOA ATTAAACAOC 3570 

AT CC ATTOCT .TATCCTOOCI AOCCCTOCAA TACCCAACAT CTCTTCCACT OAOOOTOCTC 3 6 3 0 

OATAOOCAOA AAACAOAOAA TATTAAOTOO TAOOTCTCCO AOTCAAAAAA AATOAAACCA 3690 

OTTTCCAOAA OOAAAATTAA CTACCAOOAA CTCAATAOAC OTAOTTTATO TATTTOT ATC 3750 

T AC ATTTTCT CTTTATTTTT CTCCCCTCTC TCTAO OTO OAC ATT OAT ATT AAO 3103 

Val Aap 11a Aap 11a Lyt 

1 75 

ATC COA TCT TOT COA OOO TC A TOC AOT AOO OCT TT A OCT COT OAA OTA 3 35 1 

11a Arg Sar Cya Arg Oly Sar Cyt Sar Arg Ala La* Ala Arg Ola Val 
1*0 i g 5 190 

OAT CTO AAO OAC TAT OAA OAT CAO CAO AAO CAA CTT OAA CAO OTC ATT. 3 399 




Alp Lev L 7 • 
1 95 

OCC AAA OAC 

Ala L 7 • Aip 
2 1 0 

AAA AT O AAA 

Lyi M • l L y ■ 

225 

CTT CAO AAO 

Leo Ola Lyi 



ATO A O A ATO 
Me t A r f Met 



OOC TCC ACC 

O 1 7 Set Thr 
2 7 5 

CCT AOC AOT 

Pro Set Set 
2 9 0 

ACT OOA AAC 

Tb r O 1 y Aia 

3 0 5 

TOO AAA CCT 

Trp Ly i Pre 



OOO AOC TCT 
Oly Ser Scf 



AO A CCT OOT 

Ai| Pro O 1 7 
3 5 5 

AOT OCT OOO 

Ser Ala O 1 7 
3 7 0 

CAA TOO CAC 

Ola Trp B i • 
3 8 5 

000 AAC OCO 

0 1 7 A 1 a Ala 



T C A OOA A AT 
Ser OI7 Aia 



CTO OTC ACT 

Lea Vil Tb r 
435 

OTC ACC TCT 

Val Tbr Set 
4 5 0 

OTT ACT AAO 

Va 1 Tbr Ly 1 
465 

OA A OTO OTO 

Ola Vil V* 1 



TTA OOC ACA 
Les Oly Tbr 



CAC CCT OAT 



Aip Tyr Olo 



TTA CTT CCC 
Leo Las Pre 



CCA OTT CCA 

Pro Val Pro 
2 3 0 

OTA CCC CCA 

Val Pro Pro 
2 45 

OAO TTA OAO 

O 1 o Lea Ola 
260 

TCT TAT OOA 

Ser Tyr Oly 



OCT OOA AOC 
Ala O 1 7 Ser 



COA AAC CCT 

Ar( All Pro 

3 1 0 

OOO AOC TCT 

Oly Ser Ser 

3 2 5 

OOA ACT OOA 

Oly Tbr Oly 
3 4 0 

AOT ACC OOA 

Ser Tbr O 1 7 



CAC TOO ACC 
Bit Trp Tbr 



TCT OA A TCT 

Ser Ol o Ser 
3 9 0 

AOO CCT AAC 

At 1 Pro All 
405 

OTA AOT CCA 

V • 1 Ser Pro 
4 2 0 

TCT AAA OOA 

Ser Lyi Oly 



OOT AOC ACA 
Oly Ser Tbr 



ACT OTT ATT 

Tbr Val lie 
4 7 0 

ACC TCC OA A 

Tbr Ser Ola 
4tS 

TTO TCT OOC 

Leo Ser Oly 
5 0 0 

OAA OCT OCC 



A 1 p Ola Ola 

20 0 

TCT AO A OAT 

Ser Ai| Aip 

2 I 5 

OAC TTO OTT 

Aip Leo Val 



OAO TOO AAO 
Ola Trp Lyi 



AO A CCT OOT 

Arg Pro OI7 
265 

ACC OOA TCA 

Tbr Oly Ser 
2 S 0 

TOO AAC TCT 

Trp An Sar 

2 9 5 

OOO AOC TCT 

Oly Ser Ser 



OOA CCT OOA 
O 1 7 Pro O 1 7 



AOT ACT OOA 

Ser Tbr O 1 7 
3 4 5 

ACC TOO A AT 

Tbr Trp A 1 a 
3 6 0 

TCT OAO AOC 

Ser Ola Ser 
3 7 5 

OOA AOT TTT 

Oly Ser Pbe 



AAC CCA OAC 
Aii Pro Aip 



OOO ACA AOO 

Oly Tbr Arg 
4 2 5 

OAT AAA OAO 

Aip L71 Ola 
4 4 0 

ACC ACC ACO 

Tbr Tbr Tbr 
4 5 5 

OOT CCT OAT 

OI7 Pro Aip 



OAT OOT TCT 
Aip Oly Ser 



ATA OOT ACT 
lie Oly Tbr 
505 

TTC TTC OAC 



L71 Ola Leo 



AOO CAA CAC 

Arg Ola Bit 
220 

CCC OOA AAT 

Pro Oly A • a 
235 

OCA TTA ACA 

Ala Leo Tbr 

2 5 0 

OOA AAT OAO 

Oly A • a Ol o 



OAO ACO OAA 
O 1 o Tbr Ol o 



OOO AOC TCT 

OI7 Ser Ser 
3 00 

OOO ACT OOA 

O 1 7 Tbr Oly 
315 

AOT OCT OOA 

Ser Ala Oly 
3 3 0 

AAC CAA AAC 

Aia Ola Aia 



CCT OOC AOC 
Pro Oly Ser 



TCT OTA TCT 
Ser V 1 I Ser 
3 8 0 

AOO CCA OAT 
Arg Pro Aip 
3 95 

TOO OOC ACA 
Trp Oly Tbr 
4 1 0 

A O A OAO- TAC 
Arg Olo Tyr 



CTC AOO ACT 
Lea Arg Tbr 



COT COT TCA 

Arg Arg Ser 
4 6 0 

OOT CAC AAA 

O 1 7 Bit Lyi 
475 

OAC TOT CCC 

Aip Cy • Pro 
4 90 

CTO OAT OOO 

Leo Aip Oly 



ACT OCC TCA 



Olo Ola Val 

205 

TTA CCA CTO 

Leo Pro Lao 



TTT AAO AOC 
Pbe Lye Sar 



OAC ATO CCO 

Aip Mot Pro 

25 5 

ATT ACT COA 

lie Tbr Arg 

2 70 

AOC CCC AOO 

Ser Pro Arg 
285 

OOA CCT OOA 

Oly Pro Oly 



OOO ACT OCA 
Oly Tbr Ala 



AOC TOO AAC 

Ser Trp Aia 
3 3 5 

CCT OOO AOC 

Pro Oly Ser 
3 5 0 

TCT OAA COC 

Sar Olo Arg 
3 65 

OOT AOT ACT 

Oly Ser Tbr 



AOC CCA OOC 
Ser Pro Oly 



TTT OAA OAO 

Pbe Olo Olo 
4 1 5 

CAC ACA OAA 

Bit Tbr Olo 
4 3 0 

OOT AAA OAO 

Oly Lyi Olo 
445 

TOC TCT AAA 

C y • Sar Lyi 



OAA OTT ACC 
Olo Val Tbr 



OAO OCA ATO 

Olo Ala Ma t 
495 

TTC COC CAT 

Pbe Arg Bl • 
5 1 0 

ACT OOA AAA 



I 1 e 



ATA 3947 
I 1 a 



CAO 3 9 95 

0 1 a 
2 40 

CAO 4 04 3 

01 a 



OOA 4091 

Oly 



AAC 4 139 

Aia 



AOT 4 187 

Sar 



ACC 4235 

Tbr 

3 20 

TCT 4283 
Sar 



CCT 4331 
Pro 



OOA 4379 
Oly 



OOA 4427 
Oly 



TCT 4475 

Sar 

4 0 0 

OTO 4523 
Va 1 



AAA 45 7 1 

Lyi 



AAO 46 19 

Ly . 



ACC 4667 
Tb r 



AAA 47 15 

Lyi 

4 SO 

OAT 47 6 3 

Aip 



AOO 4 • 1 1 

Ar g 



ACA 4859 



25 26 

-continued 

Bit Pte Asp Oil Ala Ala Pba Pi« Atp Tfcf Ala Sar Tkr Oly Lyt Til 

5 15 520 525 

TTC CCA OOT TTC TTC TCA CCT ATO TTA OOA OAO TTT OTC AOT OAO ACT 4907 
Pb« Pro Oly Fki Pa* Sat Pro Mat Las Oly Oil Pka Val Sar Ola Tbr 
530 535 540 

OAO TCT AOO OOC TCA OA A TCT OOC ATC TTC ACA AAT ACA AAO OA A TCC 4955 
Ols Sar Aft Oly Sar Ola Sat Oly 111 Pba Tax Ata Tit Lyt Ola Sar 
545 550 335 560 

AOT TCT CAT CAC CCT OOO ATA OCT OA A TTC CCT TCC COT OOT AAA TCT 5003 
Sar Sar Bit HI i Pro Oly III Ala Ola Paa Pro Sar Art Oly Lys Sar 
565 570 575 

TCA AOT TAC AOC AAA CAA TTT ACT AOT AOC ACO AOT TAC AAC AOA OOA 505 1 

Sar Sar Tyr Sar Lyt Ola Pba Tar Sar Sar Tar Sar Tyr Ata Art Oly 
3 tO 5 35 590 

OAC TCC ACA TTT OA A AOC AAO AOC TAT AAA ATO OCA OAT OAO OCC OOA 5099 
Atp Sar Tbr Pba Ola Sar Lyt Sar Tyr Lyt Mat Ala Atp Ola Ala Oly 
595 600 605 

AOT OA A OCC OAT CAT OA A OOA ACA CAT AOC ACC AAO AOA OOC CAT OCT 5 147 

Sar Ola Ala Atp Bit Ola Oly Tbr Bit Sar Tbr Lyt Art Oly Hit Ala 
6 10 6 15 6 20 

5=| AAA TCT COC CCT OTC AOA OOT ATC CAC ACT TCT CCT TTO OOO AAO CCT 5195 

Lyt Str Arg Pro Val Art Oly III Bat Tbr Sar Pro Laa Oly Lyt Pro 

kU 6 2 5 630 635 640 

ril 

jj^ TCC CTO TCC CCC TAOACTAAOT TAAATATTTC TOCACAOTOT TCCCATOOCC 5247 

I-sJ Sar Lao Sar Pro 

J£ CCTTOCATTT CCTTCTTAAC TCT CTOTT A C ACOTCATTOA AACTACACTT TTTTOOTCTO 5307 

IB TTTTTOTOCT AOACTOT AAO TTCCTTOOOO OCAOOOCCTT TOTCTOTCTC ATCTCTOT AT 5 36 7 

TO TCCCAAATOC CTAACAOTAC AOAOCCATOA CTCAATAAAT AC ATOTT AAA TOOATOAATO 5 427 

a 

i™. AATTCCTCTO AAACTCTATT TOAOCTTATT TAOTCAAATT CTTTCACTAT TCAA AOTOTO 54S7 

a 2 

1=^ TOCTATTAOA ATTOTCACCC AACTOATTAA TCACATTTTT AOT ATOTOTC TCAOTTOACA 5 547 

TTTAOOTCAO OCTAAATACA AOTTOTOTTA OTATTAAOTO AOCTTAOCTA CCTOTACTOO 5607 

TTACTTOCTA TT AOTT TO TO CAAOTAAAAT TCCAAATACA T TTO AOO AAA ATCCCCTTTO 5 667 

CAATTTOTAO OTATAAATAA CCOCT T AT TT OCATAAOTTC TATCCCACTO TAAOTOCATC 5727 

CTTTCCCT AT OOAOOOAAOO AAAOOAOOAA OAAAOAAAOO A AOOO AA AO A AAC AOT ATT T 5 7S7 

OCCTTATTTA ATCTOAOCCO TO CCT AT CT T TOTAAAOTTA A ATO AO A AT A ACTTCTTCCA 5347 

ACCAOCTTAA TTTTTTTTTT AOACTOTOAT OATOTCCTCC AAACACATCC TTCAOOTACC 5907 

CAAAOTOOCA TTTTCAATAT CAAOCTATCC OOATCC 5943 

( 2 ) INFORMATION FOR SBQ ID N02: 

( i )SfiQUENCECBA]UCTeRISTICS: 

( A ) LENOTB: 644 anas aadf 
(B ) TYPE: anao at* 
( D ) TOPOLOGY: Eaaar 

( i i ) MOLECULE TYPE: praaas 

( i i ) SEQUENCE DESCRIPTION: SBQ ID NOS: 

Mat Ph« Sar Mt t Art 11* v *» Cyt Lao Val Laa Sar Val Val Oly Tbr 

1 5 10 15 

Ala Trp Tbr Ala Atp Sar Oly Ola Oly Atp Pba Laa Ala Ola Oly Oly 

2 0 2 5 3 0 

Oly Val Art Oly Pro Art v *l Val Ola Art Hit Ola Sar Ala Cyt Lyt 
35 40 45 



m 



27 



28 



Asp Ser Asp Tip Pre 
50 

Pro Ser Oly Cy s At| 

65 

Pb • Tbr Asa Art II* 
I 5 

L y • Asa Asa Ly i Asp 

1 0 0 

Leo At | Oly Asp Pb c 

I 1 5 

Arg Vil Ser Olo Asp 

1 3 0 

Vil lie Olo Lys Val 

1 4 5 

Ala Ola Lao Val Asp 

1 6 5 

lie Arg Sar Cy • Arg 

1 S 0 

Aip Leo L 7 i Asp Tyr 

1 95 

Ala Lys Asp Leo Leo 

2 1 0 

Lys Met Lys Pro Val 

2 2 5 

Leo Ola Lys Val Pro 

2 4 5 

Met Arg Mat Olo Leo 

2 6 0 

Oly Ser Tor Ser Tyr 

2 7 5 

Pro Ser Ser Ala Oly 

2 90 

Tbr Oly Asa Arg Asa 

5 0 5 

Trp Lys Pro Oly Sar 

3 2 5 

Oly Ser Ser Oly Tbr 

3 4 0 

Arg Pro Oly Ser Tbr 

3 5 5 

Ser Ala Oly Bis Trp 

3 70 

Ola Trp Bit Ser Olo 

3 15 

Oly Asa Ala Arg Pro 
40 3 

Ser Oly Asa Val Ser 
420 

Leo Val Tbr Ser Lys 

4 3 5 

Val Tbr Ser Oly Set 

4 5 0 

Val Tbr Lys Tbr Val 

4 6 5 



Pba Cys Sar 
5 5 

Met Lys Oly 

7 0 

Asa Lys Leo 



Ser Bis Ser 



Ser Ser Ala 

12 0 

Leo Arg Ser 

1 3 5 

Ola Bis lie 

1 5 0 

Me t Lys Arg 



Oly Ser Cys 

Olo Asp Ola 

2 0 0 

Pro Ser Arg 

2 1 5 

Pro Asp Leo 

2 3 0 

Pro Olo Trp 

Olo Arg Pro 



Oly Tbr Oly 
2 10 

Ser Trp Asa 

2 9 5 

Pro Oly Ser 
3 10 

Ser Oly Pro 



Oly Ser Tbr 



Oly Tbr Trp 
3 6 0 

Tbr Ser Olo 

3 7 5 

Ser Oly Ser 

3 9 0 

Asa Asa Pro 



Pro Oly Tbr 



Oly Asp Lys 
4 4 0 

Tbr Tbr Tbr 
4 5 5 

Me Oly Pro 
4 7 0 



-continued 

Asp Olo Asp 

Lao lie Asp 

75 

Lys Asa Sar 

90 

Lao Tbr Tbr 

10 5 

Asa Asa Arg 

Arg lie Olo 



Ola Leo Lao 

1 5 5 

Lao Olo Val 
1 70 

Sar Arg Ala 
1 t 5 



Trp Asa 

60 

Olo Val 



Lao Pba 



Asa lie 



Asp 



Va 1 
1 4 0 



Asp Arg Ola 



Val Pro Oly 
2 3 5 

Lys Ala Leo 

2 5 0 

Oly Oly Asa 

2 6 5 



Ser Olo Tbr Olo 



Ser Oly Ser 



Ser Oly Tbr 
3 1 5 

Oly Ser Ala 

3 3 0 

Oly Asa Ola 
3 4 3 



Ser 
3 0 0 

Oly 
Oly 



Asa Pro Oly Ser 



Ser Ser Val 

Pbe Arg Pro 

3 95 

Asp Trp Oly 
4 1 0 

Arg Arg Olo 
4 2 5 

Olo Leo Arg 

Tbr Arg Arg 



Asp Oly Bis 
4 7 5 



S e r 
3 t 0 

Asp 

Tb r 
Tyr 
Tbr 



Ser 
4 6 0 



Asa 

1 2 5 



Leo 



Tyt Ly 

Asa Ol 

Olo Ty 
9 

Mat 0 1 

1 1 0 

Tb t Ty 

Lys Ar 



Ola Lys Asa Va 



Asp tie 



Leo Ala 



Ola Lys Ola Leo 



B i s 
2 2 0 



O 1 o 
205 



Lao 



Asp 11 
1 7 

Arg Ol 

1 9 0 

Ola Va 
Pro La 



Asa Pba Lys Sa 



Tbr Asp 



Olo tie 



Sar 
2 t 5 

Oly 
Oly 
Sar 



Asa Pro 



Sar 

3 6 5 



Met P r 

2 5 



Tb r Ar 
2 7 0 



Pro Ar 



Pre O 1 



Tbr A 1 



Trp As 

3 3 

Oly S a 
3 5 0 

Olo At 



Oly Sar Tb 



Sar Pro Ol 



Pbe 

Bi s 



Oly 
445 

Cys 



Olo O 1 
4 1 

Tbr Ol 
4 3 0 

Lys O I 
Ser Ly 



Lys Olo Val Tb 



Cy • 



Asp 

• 0 

Ot a 



I 1 a 
Asa 
Ly s 

Arg 

16 0 

Ly s 
Va 1 
I 1 e 
I 1 a 

01 a 

2 40 

Ol a 
Oly 
Asa 
Sar 

Tbr 

3 2 0 

Sar 

Pro 
Oly 
Oly 

Sar 

4 0 0 

Va 1 
Ly s 
Lys 
Tb r 



Lys 
4 • o 



29 30 

-continued 



Ola Vil Val Tbr Ser Ola Aip Oly Ser Aap Cye Pre Ola Ala Mat Aap 
4S5 490 495 

Lea Oly Tar Lai Sar Oly lie Oly Tar Lea Atp Oly Pbe Art Bit Ar g 

5 00 505 5 1 0 

Bit Pro Aap Ola Ala Ala Pbe Pbe Aap Tar Ala Sar Tar Oly Lye Tbr 

515 320 525 

Pbe Pro Oly Pbe Pbe Ser Pro Mat Lea Oly Ola Pbe Val Sir Ola Tbr 

530 535 540 

Ola Ser Aig Oly Ser Ols Ser Oly lie Pbe Tbr Abb Tbr Lye Ola Ser 
545 550 555 560 

Ser Ser Bit Bii Pro Oly lie Ala Ola Pbe Pro Ser Arg Oly Lye Sar 

565 570 575 

Ser Ser Tyr Sar Lji Ola Pbe Tbr Ser Ser Tbr Sar Tyr Asa Arg Oly 
510 5S5 590 

Aap Ser Tbr Pbe Ola Ser Lya Ser Tyr Lya Met Ala Aep Ola Ala Oly 

595 600 605 

Ser Ola Ala Aap Bia Ola Oly Tbr Bis Ser Tbr Lya Arg Oly Bie Ala 

6 10 6 15 6 2 0 

Lya Ser Arg Pro Val Arg Oly lie Bia Tbr Ser Pro Laa Oly Lya Pre 

625 630 635 640 

Ser Lea Ser Pro 

( 2 ) INFORMATION FOR SBQ ZD N03: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; SS71 bee p» 
( B ) TYPE: aackfc and 
( C ) STRANDEZ3NESS: doable 
(D)TOPCtjOOY:&aaer 

( i i ) MOLECULE TYPE: DNA (fcaonic) 

< * i i ) IMMEDIATE SOURCE: 
( B ) CLONE: tin 

( i x ) FEATURE: 

( A ) NAME/KEY: aatacJtNA 
( B ) LOCATION: U469 

( i x ) FEATURE: 

( A ) NAME/KEY: cxee 
( B ) LOCATION: 4XL5I3 

( ■ x ) FEATURE: 

( A ) NAME/KEY: iara 
( B ) LOCATION: 5*0257 

( i x ) FEATURE: 

( A ) NAME/KEY: axoa 
(B ) LOCATION: 32513449 

( i x ) FEATURE: 

( A ) NAMB/XEY: nana 
( B ) LOCATION: 3450J93S 

( i x ) FEATURE: 

( A ) NAME/KEY: exoa 
( B ) LOCATION: 3939.4122 

( i x ) FEATURE: 

( A ) NAME/KEY: bana 
( B ) LOCATION: 4123.5042 

( 1 x ) FEATURE: 

( A ) NAMB/XEY: exoa 
( B ) LOCATION: 500-5270 



( i x ) FEATURE: 
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( A ) NAME/KEY: asm 
( B ) LOCATION; 5271-3S30 

( i x ) FEATURE: 

< A ) NAME/KEY: «xs* 

( B ) LOCATION: 5131-5944 

( i i ) FEATURE: 

( A ) NAME/KEY: an 
( B ) LOCATION: 5945j6£32 

( i i ) FEATURE: 

( A ) NAME/KEY: mm 

( B ) LOCATION: 663Ju675» 

( i i ) FEATURE: 

( A ) NAME/KEY: mm 
( B ) LOCATION: 6759-6966 

( i x ) FEATURE: 

( A ) NAME/KEY: am 

( B ) LOCATION: 6967-7252 

( i x ) FEATURE: 

( A) NAME/KEY: mm 
( B ) LOCATION: 7251-7110 

< i x ) FEATURE: 

( A ) NAME/KEY: mm 

( B ) LOCATION: 7171-S1G2 

( i x ) FEATURE: 

( A ) NAME/KEY: 3*UTR 
( B ) LOCATION: S103_SS37 



=J£ (ii ) FEATURE: 

,U ( A ) NAME/KEY: nbc_RNA 

^ ( B ) LOCATION: «53&_tt7* 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

p ( B ) LOCATION: jM*470-5l3. 325U449. 3939.4122, 

' ™" 5O43_5270, 5C31-5944. 663X6751, 6967.725Z 

7S71U102) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N03: 

OAATTCATOC CCCTTTTOAA ATAOACTTAT OTCATTOTCA OAAAACATAA OCATTTATOO 60 

TATATCATTA ATOAOTCACO ATTTTAOTOO TTOCCTTOTO AOTAOOTCAA ATTTACTAAO 120 

CTTAOATTTO TTTTCTCACA TATTCTTT CO OAOCTTOTOT AOTTTCCACA TTAATTTACC 110 

AOAAACAAOA TACACACTCT CTTTOAOOAO TOCCCTAACT TCCCATCATT TTOTCCAATT 240 

AAATOAATTO AAOAAATTTA ATOTTTCTAA ACTAOACCAA CAAAOAATAA TAOTTOTATO 300 

ACAAOTAAAT AAOCTTTOCT OOOAAOATOT TOCTTAAATO ATAAAATOOT TCAOCCAACA 360 

AOTOAACCAA AAATTAAATA TTAACTAAOO AAAOOTAACC ATTTCTOAAO TCATTCCTAO 420 

CAOAOOACTC AOATATATAT AOOATTOAAO ATCTCTCAOT T A AOTCT AC ATO AAA 473 

Mm t Lyi 

I 

AOO ATO OTT TCT TOO AOC TTC CAC AAA CTT AAA ACC ATO AAA CAT CT A 323 
Art M«t V«l S«r Tip S«r Pat His Ly i L • u L j $ Thr Mat Ly» Hat L«* 
* 10 13 

TTA T TO CTA CT A TTO TOT OTT TTT CT A OTT A AO TCC CAA OOT OTC AAC 37 1 

L«o L.n L • q L«o L«b Cyt V»l Pbt L • o Vil L y • Smt Oil Oly V*l Alt 

20 2 3 3 0 

OAC A A T OAO OAO OTOAATTTTT TAAAOCATTA TT AT ATT ATT AOTAOTATTA 623 
Alp A • a O 1 u Ola 

3 5 



TTAATATAAO ATOTAACATA ATCATATTAT OTOCTTATTT TAATOAAATT AOCATTOCTT 



613 
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ATAOTTATOA AATOOAATTO TT AACCTCTO ACTT ATTOT A TTTAAAOAAT OTTTCATAOT 

ATTTCTTATA TAAAAACAAA OTAATTTCTT OTTTTCTAOT TTATCACCTT TOTTTTCTTA 

AOATOAOOAT OOCTTAOCTA ATOTAAOATO TOTTTTTCTC ACTTOCTATT CTOAOTACTO 

TOATTTTCAT TTACTTCTAO CAATACAOOA TTACAATTAA OAOOACAAOA TCTOAAAATC 

TCACAAACTA TAAAATAATA AAAOAOCAOA ATTTTAAOAT A AAAO AAACT OOTOOTAOOT 

AOATTOTTCT TTOOTOAAOO AAOOTAATAT AT ATTOTT AC TOAOATTACT ATTTATAAAA 

ATTATAACTA AOCCTAAAAO CAAAATACAT CAAOTOTAAT OAT AOAAAAT OAAAT ATTOC 

TTTTT TCAOA TOAAAAOTTC AAATTAOAOT TAOTOTOTAT TOTT ATT ATT AATAOTTATO 

AAACACOGTT CAOTCTAATT TATTTATTTO T AOAACAOTT TOTCCTCAAC T ATT ATTTTT 

OCTOACTTAT TOCTOTTAAT TTOCAOTTAC TAAAAATACA OAAATOCATT TAOOACAATO 

OATATTTAAO AAATTTAAAT TTTATCATCA AACOTATCAT OOCC AAATTT CTTACATATA 

OCATAOTATC ATT AAACT AO AAATAAOAAT ACACAATAAT ATTTAAATOA AOTOATTCAT 

TTCOOATCAT TATTOAOTTT CAAOOOAACT TOAOTOTTOT ACTTATCAOA CTCTACATOT 

AAOAACATAT AOTTAATCTO OTTOTOTOTO T AAAAACAT A TOOTT AATCT OOTTAAOTCT 

OOTTAATCAT ATTAOOTAAO AAAAATOT AA AOAATOTOTA AO A CO A A ATT TTTOT AAAOT 

ACTCTOCAAA OCACTTTCAC ATTTCTOCTT ATCAACTAAA CCTCACAOAO ATAOTTTAAT 

AOTTTAOOCT TTAAAATOOA TTTTOATTAT TCAACAAOTO OCCTTCATAA TTTCTTT AAO 

TOTTTTTCTT TAAOTATATA CTTTCTTTAA ATATTTTTTA AAATTTCCTT TTCTCTAOTA 

AAOCCAOACC ATCCATOCTA CCTCTCTAOT OOCACTCTOA A AT AAA AAO A AAATAOTTTT 

CTCTOTTATA ATTOTATTTO TAATAAOCAO ATOAATCACA TTTCTTAAAA TTTOTTTT AO 

AOAOOOTAAO CTCTOACTAO OACCATOACT TCAATOTOAA ATATOTATAT ATCCTCCOAA 

TCTTTACATA TTAAOAATOT ATATAOTCAA CTOOTTAAAC AOOAAAATCT OOAACAOCCT 21 

OOCTOOOTTT TAATCTTAOC ACCATCCTAC TAAATOTTAA ATAATATTAT AATCTAATOA 2< 

ATAAATOACA ATOCAATTCC AAATAOAOTT CATCTOATOA CTTCTAOACT CACAAAATTO 2 

CAAOAOAOCT CAOTTOTTOC TCAOTTOTTC CAAATCATOT COTTTOTTAA TTTOTAATTA 

AOCTCCAAAO OATOTATAOC TACTOACAAA AAAAAAAATO AOAATOTAOT T AATCCA AAT 

CAAAACTTTC CTATTOCAAT OCOTATTTTC TOCTTCATTA TCCTTTAATA TAATATTTTA 

AOTTAOCAAO TAATTTTAAT TACAATOCAC AAOCCTTOAO AATTATTTTA AATATAAOAA 

AATCATAATO TTTOATAAAO AAATCATOTA AOAAATTTCA AOATAATOOT TTAACAAATA 

ATTTTOTTOA TAOAAOATAA OACTAAAAOT OAAATTCOAA OTOOAOAOOA CACTTAAACT 

OTAOTACTTO TT ATOTOTO A TTCCAOTAAA AATAOTAATO AOCACTTATT ATTOCCAAOT 

ACTOTTCTOA OOOTACCATA TOCAATAAOT TATTTAATCC TTACAATAAT CTTOTAAOOC 

AOATTCAAAC TATCATTACA CTTATTTTAC AOATOAOAAA ACTOOOOCAC AOATAAAOCA 

ACTTOCCCAA OOTCTCATAO CTOTAAOTCA ACCCTACOOT CAAOACCTAC AAOTAOCCOA 

OCTCCAOAOT ACATTATOAO OOTCAAAOAT TOTCTTATTA CAAATAAATT CCAAOTAOAA 

TCAACCTTTA ATAAOTCTTT AATOTCTCTT AAATATOTTT ATATAOOAOT CTAATCACCA 

ATTCACAAAA ATOAAAOTAO OOAAATOATT AACAATAATC ATAOOAATCT AACAATCCAA 

OTOOCTTOAO AATATTCATT CTTCTTOACA OTATAOATTC TTTACAATTT. COTAAOTTCC 

AATOTATOTT TTAOOAATAT OAOOTCATTA CTATTCATAA TCTOATACAO CTTTATCCT A 

AOOCCTCTCT T T A A AAA C T A CACTOCATCA TAOCTTTTTT OTOCAOTTOO TCTTTCT ACT 
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OTTACTOAAC AOTAAOCAAC CTACAO ATTC ACTATCACCA ACCAOCCAOT TOATOOATCT 3 143 

TAAOCAAATT ATCAAOCTTO TOATAACCTA AATT AT AAAA TOAOOOTOTT OOAATAOTTA 3 20 3 

CATTCCAAAT CTTCTATAAC ACTCTOTATT AT ATTTCTOC CTCATTCCTT OTAO OOT 3260 

Ol 7 

TTC TTC ACT OCC COT OOT CAT COA CCC CTT OAC AAO A AO AO A OA A OAO 3301 

Pbe Ffa* Sor Ala Arg Ol y Bit Arg Pro Leo Aap Lyi Ly» Aig Ola Ola 
40 43 30 33 

•OCT CCC AOC CTO AOO CCT OCC CCA CCO CCC ATC AOT OOA OOT OOC TAT 3336 
Ala Pto Ser Leo Arg Pro Ala Pro Pro Pro II* Ser Ol y Ol y O I y Ty t 
6 0 6 5 7 0 

COO OCT COT CCA OCC AAA OCA OCT OCC ACT CAA AAO AAA OTA OAA AO A 3404 
Arg Ala Arg Pro Ala Ly i Ala Ala Ala Th r Ola Ly I Lyi Vil Olo Arg 
73 SO %5 

AAA OCC CCT OAT OCT OOA OOC TOT CTT CAC OCT OAC CCA OAC CTO 3449 
Ly i Ala Pro Aap Ala Oly Oly Cy i Loo Bis Ala Aip Pro Asp Loo 
9 0 9 5 100 

OTOOOTOCAC TOATOTTTCT TOCAOTOOTO OCTCTCTCAT OCAOAOAAAO CCTOTAOTCA 3 509 

TOOCAOTCTO CTAATOTTTC ACTOACCCAC ATTACCATCA CTOTTATTTT OTTTOTTTAT 3569 

TTTOOAAATA AAATTCAAAA CATAAACATA TTOOOCCTTT OOtTTAOOCT TTCTTTCTTO 3629 

TTTTCTTTOO TCTOOOCCCA AAATTTCAAA TTAOOATATO TOOOTOCCAC CTTTCCATTT 36S9 

OTATTTTOCC ACTOCCTTTO TT T AOTTOOT AAAATTTTCA TAOCCCAATT ATATTTTTTC 3749 

TOOOOTAAOT AATATTTTAA ATCTCTATOA OAOTATOATO ATOACTTTCO AATTTCTOOT 3 009 

CTTACAOAAA ACCAAATAAT AAATTTTTAT OTTOOCTAAT COTATCOCTO AATTTTCCTA 3»69 

TOTOCT ATTT T A A C A A A TO T CCATOACCCA AATCCTTCAT CTAATOCCTO CT ATTTTCTT 3 9 2 9 

TOTTTTTAO OOO OTO T TO TOT CCT ACA OOA TOT CAO TTO CAA OAO OCT 3 977 

Oly Val Leo Cy i Pro Tbr Oly Cy i Ola Loo Ola Olo Ala 

j«& 1 0 3 110 115 

H= TTO CTA CAA CAO OAA AOO CCA ATC AO A A AT AOT OTT OAT OAO TT A A AT 4023 

f gff L«o Leo Ola Ola Olo Arg Pro I 1 • Arg A • a Ser Val Aap Olo Loo Asa 

*-*J 1 2 0 1 2 5 13 0 

„ AAC A A T OTO OAA OCT OTT TCC CAO ACC TCC T CT TCT TCC TTT CAO TAC 4073 

*U Aia Aia Val Olo Ala Val Ser Ola Tbr Sor Sor Sor Sor Pbo Ola Tyr 

135 140 145 

ATO TAT TTO CTO AAA OAC CTO TOO CAA AAO AOO CAO AAO CAA OTA AAA O 4122 
Mat Tyr Lao Lao L y « Aip Loo Trp Ola Ly i Arg Ola Ly i Ola Val Ly i 
15 0 i 3 3 16 0 

OTAOATATCC TTOTOCTTTC CATTCOATTT TCAOCTATAA AATTOOAACC OTTAOACTOC 41 12 

CACOAOAATO CATOOTTOTO AO A AO A TT A A CATTTCTOOO TTAOTOAATA OCATTCATAC 4242 

OCTTTTOOOC ACCTTCCCCT OCAACTTOCC AOATAAOCAC T ATTC AOCTC TTATTCCCAO 4302 

TCTOACATCA OCAAOTOTOA TTTTCTATOA AAAATTCTAC T ATO ACTCCT TATTTTAAOT 4362 

ATACAAOAAA CTTOTOACTC AOAAOATAAT ATTTACAOAO T OOA A A A A A A CCCCTAOCAT 442 2 

TTATAOTTTT AACATTTOAO OTTTTOAATO AOAOAOTT AT CCATAATA T A TTCAATTOTO 44S2 

TTOTOO AT A A TOACACCTAA CCTOTOAATC TTOAOOTCAO A ATOTTOAO T OCTOTTOACT 4542 

TOOTOOTCAO OAAACAOCTA OTOCOTOAOC CTOOCACAOO CATCTCAOTO AOTAOCATAC 4602 

CCACAOTTOO AAATTTTTCA AAOAAATCAA AOOAATCATO ACATCTTATA AATTTCAAOO 4662 

TTCTOCT AT A CTTATOTOAA ATOOATAAAT AAATCAAOCA TATCCACTCT OTAAOATTOA 4722 

ACTTCTCAOA TOOAAOACCC CAATACTOCT TTCTCCTCTT TTCCCTCACC AAAOAAATAA 47i2 

ACAACCTATT TCATTT ATTA CTOOACACAA T CTTT AOCOT ATACCTATOO TAAATTACTA 4S42 
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OTATOOTOOT TAOOATTTAT OTTA ATTTOT AT ATOTCATO COCCAAATCA TTTCCACTAA 4903 

ATATOACTAT ATATCATAAC TOCTTOOTOA TAOCTCAOTO TTTAATAOTT TATTCTCAOA 4963 

AAATCAAAAT TOTATAOTTA AATACATTAO TTTTATOAOO CAAAAATOCT AACT ATTTCT 5 03 3 

ACATAATTTC ATTTTTCCAO AT A AT OAA AAT OTA OTC AAT OAO TAC TCC 507 1 

Asp An Oli Asa Vil Val Asa Olt Tyt Sar 

165 170 

TCA OAA CTO OAA A AO CAC CAA TT A TAT ATA OAT OAO ACT OTO AAT AOC 5 119 

Sor OIb Los Olo Lys Hi i Ola Lao Tyr I I « Asp Ola Tar Val Asa Set 

175 110 165 

AAT ATC CCA ACT AAC C TT COT OTO CTT COT TCA ATC CTO OAA AAC CTO 5167 

Asa lie Pro Thr Aia Loa Arg Val Las Art Sif 11a Lav Ola Asa La* 

190 195 300 305 

A OA AOC AAA ATA CAA A AO TTA OAA TCT OAT OTC TCA OCT CAA ATO OAA 53 15 

Arg Sex Lyt lis Ola Lys Loo Olo Sar Aap Val Sar Ala Ola Mat Ola 

3 10 3 15 330 

TAT TOT COC ACC CCA TOC ACT OTC AOT TOC AAT ATT CCT OTO OTO TCT 5365 

Tyr Cyi Arg Thr Pro Cyi Tar Val Sar Cys Asa Ila Ptc Val Val Sat 

235 330 355 

OOC AAA O OTAACTOATT CATAAACATA TTTTTAOAOA OTTCCAOAAO AACTCACACA 55 30 
O 1 y Lyi 

CCAAAAATAA OAOAACAACA ACAACAACAA AAATOCT AAO TOOATTTTCC CAACAOATCA 53 SO 

TAATOACATT AC AO TAC AT C ATAAAAATAT CCTTAOCCAO TTOTOTTTTO OACTOOCCTO 5440 

OTOCATTTOC TOOTTTTOAT OAOCAOOATO OOOCACAOOT AOTCCCAOOO OTOOCTOATO 5500 

TOTOCATCTO COTACTOOCT TOAACAOATO OCAOAACCAC AOATAOATOT AOAAOTTTCT 5560 

CC AT TT TOT O TOTTCTOOOA OCTCATOOAT ATTCCAOOAC ACAAAAOOTO OAOAAOAOCT 5630 

TTOTTCATCC TCTTAOCAOA TAAACOTCCT CAAAACTOOO TTOOACTTAC TAAAOTAAAA 56S0 

TOAAAATCTA ATATTTOTTA TATTATTTTC AAAOOTCTAT AATAACACAC TCCTTAOTAA 5740 

CTTATOTAAT OTTATTTTAA AOAATTOOTO ACTAAATACA AAOTAATTAT OTCATAAACC 5S00 

CCTOAACATA ATOTTOTCTT ACATTTOCAO AA TOT OAO OAA ATT ATC AOO AAA 5*53 

Olo Cys Olo Ola Ila Ila Arg Lys 

240 345 

OOA OOT OAA ACA TCT OAA ATO TAT CTC ATT CAA CCT OAC AOT TCT OTC 590 1 

Oly OIr Olo Thr Sor Ols Mot Tyr Los III Ola Pro Asp Sor Sar Val 

230 255 260 

AAA CCO TAT A OA OTA TAC TOT OAC ATO AAT ACA OAA AAT OOA O 5944 

Lys Pro Tyr Arg Val Tyr Cys Asp Mot Asa Tar Ola Asa Oly 

265 270 275 

OT A AOCTTTC OACAOTTOTT OACCTOTTOA TCTOTAATTA TTTOOATACC OTAAAATOCC 6004 

AOOAAACAAO OCCAOOTOTO OTOOCTCATA CCTOTAATTC CAOCACCTTO OOAOOCCAAA 6064 

OTOOOCTOAT AOCTTOAOCC TAOOAOTTTO AAACTAOCCT OOOCAACATA ATOAOACCCT 6 134 

AACTCTACAA AAAAAAAAAA AATACCAAAA AAAAAAAAAA AATCAOCTOT OTTOOTAOTA 6164 

TOTOCCTOTA OTCCCAOCTA TCCAOOAOOC TOAOATOOOA OATCACCTOA OCCCACAACC 6344 

TOOAOTCTTO ATCATOCTAC TOAACTOTAO CCTOOOCAAC AOAOOATAOT OAOATCCTOT 6304 

CTCAAAAAAA AAAATTAATT AAAAAOCCAO OAAACAAOAC TTAOCTCTAA CATCTAACAT 6364 

AOCTOACAAA OOAOTAATTT OATOTOOAAT TCAACCTOAT ATTTAAAAOT TATAAAATAT 6434 

CTATAATTCA CAATTTOOOO T A AO AT A AAO CACTTOCAOT T TCC AAAOAT TTTAC AAOTT 6414 

TACCTCTCAT ATTTATTTCC TTATTOTOTC TATTTTAOAO CACCAAATAT ATACTAAATO 6544 

OAATOOACAO OOOATTCAOA TATTATTTTC AAAOTOACAT TATTTOCTOT TOOTTAATAT 6604 



39 



40 



-continued 



ATOCTCTTTT TOTTTCTOTC AACCAAAO OA TOO ACA OTO ATT CAO AAC COT 

Oly Ttp Tir Val II* Ola Aia Art 
210 2t5 

CAA OAC OOT AOT OTT OAC TTT OOC AOO AAA TOO OAT CCA TAT AAA CAO 
Ola Aip Oly Ser Val Aip Pbe Oly Ar§ Lyi Ttp Aip Mo Tyt Lya Ola 
290 295 300 

OOA TTT OOA AAT OTT OCA ACC AAC ACA OAT OOO AAO AAT TAC TOT OOC 
Oly Pbe Oly Asa Val Ala Tbi Aia Thr A.p Oly Ly» Aia Tyt Cyi Oly 

3 05 3 10 515 

CTA CCA O OTAACOAACA OOCATOCAAA ATAAAATCAT TCTATTTOAA ATOOOATTTT 

TTTTAATTAA AAAACATTCA TTOTTOOAAO CCTOTTTTAO OCAOTTAAOA OOAOTTTCCT 

OACAAAAATO TOOAAOCTAA AO AT A AOO OA AOAAAOOCAO TTTTT AOTTT CCCAAAATTT 

TATTTTTOOT OAOAOATTTT ATTTTOTTTT TCTTTT AO OT OA A TAT TOO CTT 

Oly Oil Ty i Tip La* 

3 2 0 

OOA AAT OAT AAA ATT AOC CAO CTT ACC AOO ATO OOA CCC ACA OA A CTT 
Oly Asa A.p Lyi lie Ser Ola Lao Thr Arg Mat Oly Pro Tb t Olo Lee 



3 2 5 



3 3 0 



3 3 5 



3 4 0 



TTO ATA O A A ATO OAO OAC TOO AAA OOA OAC AAA OTA AAO OCT CAC TAT 
Leo Me Olo Met Olo Asp Tip Lyi Oly Aip Lye Val Lyi Ala Hii Tyr 



3 45 



3 5 0 
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OOA OOA TTC ACT OTA CAO AAT OA A OCC AAC AAA TAC CAO ATC T C A OTO 



Oly Oly Pbe Thr Val 

3 6 0 



At* Olo Ala Aia Lyt Tyt Ola lie Sor Vol 



3 6 5 



370 



AAC AAA TAC AO A OOA ACA OCC OOT AAT OCC CTC ATO OAT OOA OCA TCT 
An Lyi Tyr Art Oly Thr Ala Oly Aia Ala Lao Mat Aip Oly Ala Sor 



3 75 



3 t 0 



3 t 5 



CAO CTO ATO OOA O A A AAC AOO ACC ATO ACC ATT CAC AAC OOC ATO TTC 
Ola Leo Met Oly Olo Aia Ai| Thr Met Thr lie Hie Aia Oly Mot Pbe 



3 9 0 



3 9 5 



TTC AOC ACO TAT OAC AO A OAC AAT OAC OOC 
Pbe Set Tbr Tyr Aip Arg Aip Aia Aip Oly 



4 0 0 

TO OTATOTOTOO 
Tr p 



665 5 

6 70 9 

6 73 1 

6101 
6 »6 S 

6 9 2 » 
6910 

7 0 2 t 

70 7 6 
7 12 4 
7 17 2 
72 2 0 
7 2 6 2 



4 0 5 


4 1 0 


4 1 


3 






CACTCTTTOC 


TCCTOCTTTA 


AAAATCACAC 


T A AT ATC ATT 


ACTCAOAATC 


ATT AACAATA 


7 3 2 2 


TTTTT AAT AO 


CT ACC ACTTC 


CTOOOCACTT 


ACTOTCAOCC 


ACTOTCCTAA 


OCTCTTTATO 


73 S 2 


C ATCACTCOA 


AAOC ATTTCA 


ACTAT AAOOT 


AOACATTCTT 


ATTC TCATTT 


TACAOATOAO 


7 44 2 


ATTT AOAOAO 


ATTACOTOAT 


TTOTCCAATO 


TCACACAACT 


ACCCAOAOAT 


AAAACTAOAA 


7 5 0 2 


T TTO AOC A C A 


OTT ACTTTCT 


OAATAATOAO 


CATTTAOATA 


AATACCTATA 


TCTCT AT ATT 


7562 


CTAAAOTOTO 


TOTOAAAACT 


TTCATTTTCA 


TTTCCAOOOT 


TCTCTOATAC 


TAAOOOTTOT 


762 2 


AAAAOCT ATT 


ATTCCAOTAT 


AAAOTAACAA 


ACACAOTCCC 


TAOATOOATT 


OCCACAAAOO 


76*2 


CCCAOTT ATC 


TCTCTTTCTT 


OCTATAOOOC 


ACAOOAOOTC 


TTTOOTOTAT 


TAOTOTOACT 


7742 


CTATOTATAO 


C ACCCAAAOO 


AAAOACTACT 


OTOCACACOA 


OTOTAOCAOT 


CTTTTATOOO 


7102 


TAATCTOCAA 


AACGTAACTT 


OACCACCOT A 


OTTCTOTTTC 


TAATAACOCC 


AAACACATTT 


7162 


TCTTTCAO O 


TTA ACA TCA 


, OAT CCC AO A AAA CAO 


TOT TCT AAA 


OA A OAC 


79 10 




Leo Thr Sit 


Aip Pro Arg Lyi Ola 
4 2 0 


Cyi Sar Lyi 
425 


Olo Aip 





OOT OOT OOA TOO TOO TAT AAT AO A TOT CAT OCA OCC AAT CCA AAC OOC 
Oly Oly Oly Ttp Ttp Tyt Aia Arg Cyi Hii Ala Ala Aia Pro Aia Oly 
430 433 440 



79 5 % 



AOA TAC 
Arg Tyr 



TAC TOO OOT OOA CAO TAC 
Tyr Trp Oly Oly Ola Tyr 



ACC TOO OAC ATO OCA AAO CAT OOC 
Tbr Trp Aip Met Ala Lyi Hii Oly 



• 00 6 



41 



42 



-continued 



44 3 



450 



4S5 



AC A OAT OAT OOT OTA OTA TOO ATO AAT TOO AAO 
Tbr Aap Asp Oly Val Vat Tip Mat An Trp Lya 
465 470 

ATO AOO AAO ATO AOT ATO AAO ATC AOO CCC TTC 
M« t Art Lya Sar Mat Lyt 11a Art F » • Faa 

4S0 413 

TACOTAOATT TTTOCTCTTC TOTATOTOAC AACATTTTTO 

TTCTTTCATA CATTATATTC CTCTAAAACT CTCAAOCAOA 

AAAAOTATAO OATAAATTAC ATTAAAATAO CACATOATTT 

TCTTOCTCAC CCAAOAAOTA ACAAAAOTAT AOTTTTOACA 

AOTTCTAOTT OATTOCOAOA ATTTTCAAAT AAOOAAOAOO 

OOAAAACCAT OACOOAAAOO AAAAACTOAT OTTTAAAAOT 

ATTTATOTAO OATCTOTCAA AOAAAACTTC CAAAAAOATT 

OTTOCAATAA OTTAATOTTT TCTTOTTTTO T A AT CC AC AC 

CACTTOTAAO OAAOOAOAAO COTTCACAAC CTCAAATAOC 

TTTOAAOATT TAAAATCTOA CTCTAOOACO OOCACOOTOO 

ACTTTOOOAO OCTOAOOCOO OCOOTCACAA OOTCAOOAOT 

ATOOTOAAAC CCCATCTCTA CTAAAAATAC AAAAATT AOC 



460 

OOO TCA TOO TAC T C A 
Oly Sar Trp Ty r Sar 
475 



• 0 5 4 



TTC CCA CAO CAA TAOTCCCCI109 
Paa fro Ola Ola 
490 



TACATTATOT TATTOOAATT 
COTOAOTOTO ACTTTTTOAA 



TOTAOOTCCC AOCTAOCCTO TOAOOTOOAO ATTOCATTOA OCCAAOATC 



• 169 

• 3 2 9 

TCTTTTOTTT TCTTCATTTC •3»9 
OAOTTOOTOT TCATAATTTC 1)49 
OOTCTTTTAT CCTTOTCOTA 1409 
CCACTTTTAA AACTAT ATTT S469 
TATTAATTAA ACCAOACTCT SS29 
ATTC A ATO AO TTAOOCTT TO S5S9 
TAATAAACCO OTCTTOAATA 1649 
CTCACOACTA TAATCCCAAC S709 
TCAAOACCAO CCTOACCAAT «769 
CAOOCOTOOT OOCAOOTOCC 1129 

• • 7 » 



( 2 ) INFORMATION FOR SBQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 491 m nil 
( B ) TYPE: amao and 
( D ) TOPOLOGY: bmcm 

( i i )MOLECUL£TYPE:rnau 

( i i ) SEQUENCE DESCRIPTION: SBQ ID NO*: 

M • i Lyi Art Mat Val Sar Trp Sar Pbe 



lit L«a Lea Laa Lao Laa La 

2 0 



a C y • 



Va 1 

25 



H i a 
1 0 



Paa 



Lyt Laa 
Laa Val 



Lyt 
Ly • 



Tbr Mat Lya 

1 5 

Sar Ola Oly 

5 0 



Val Aia Aip Aaa Ola Ola Oly 

3 5 



Paa 

4 0 



Paa Sar Ala Art 



L«o Aip Ly* Lyi Art Ola Ola Ala Pra Sar Laa 



30 



5 5 



Art 

6 0 



Oly 
4 5 

Pro 



Hit Art *r e 
Ala Pre Pre 



Pro lit Sar Oly Oly Oly Tyr 



6 5 



70 



Thr Ola Lyt Lyi Val Ola Art 
S 5 

Hit Ala Asp Pro A a p Laa Oly 

I 0 0 



Art 

Ly . 
Va 1 



Ala Art 



A 1 a 



Laa 

1 0 3 



Pr o 
9 0 



P r o 

73 

Aip 



A 1 a 
A 1 a 



Cm Pro Tar 



Lyt 

Oly 
Oly 



Ala Ala Ala 
• 0 

Oly C y • Lav 
95 

Cya Ola Laa 

1 1 0 



Ola Olo Ala Lao Lao Ola Ola 

1 1 5 



0 1 a 

1 2 0 



A r | Pro 1 1 i Art 



Olo Lao Aia Aia Aia Val Ola Ala Val Sar Ola 

13 0 13 3 



Aaa 

1 2 5 



Tbr 
1 4 0 



Sar Val A ■ p 



Sar Sar Sar Sar 



Pba Ola Tyr Mat Tyr Loo Lao Lyt Aip Laa 



Trp 
1 5 5 



Ola Lyt 



Art Ola Lyi 

1 6 0 



Ola Val Lyt Atp Aaa Olo Ata 

1 6 5 



Val Val 



A a a 

1 7 0 



Ola Tyr Sar 



Sar Olo Lao 

1 75 



43 
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-continued 



Ola Lyt Bit Ola L • q 
1 I 0 



Tyx 



I 1 • Asp Oil 
1 S3 



Tbr Val 



Asa Sex Ata 

190 



II* Pro 



Th x At a Lea Ax g Val 

1 95 



Lea 



II* Ola Lyi Leo Ol 

2 1 0 



a 8 • x 



Tbx Pro Cyi Tax Val Sex 

2 2 5 2 3 0 

Cyi Ola Ola lit II* Axg 
2 4 5 



Axg Sax tla 

2 0 0 

Atp Val Sax 

2 15 

Cy§ Ata II* 

Ly • Ol y O 1 y 



Lea Ola 



Ala Ola 



Pxo 



O I a 
2 5 0 



Va 1 

2 3 5 



Ata L*a Axg 

205 

M* t Ola Ty x 

2 2 0 

Val S * x Ol y 



Th r Sex Ola Me t 



Sax 
Cy t 
Ly • 



Ty x 
255 



ty * 
Axg 



Ola 
240 



L * a 



lie Ola Pxo Atp S*x S*x 
2 6 0 

Ata Tbx Ola Ata Oly Oly 

2 7 5 

S«x Val Atp Pfa* Oly Axg 

2 90 



Val L y • Pxo 
2 6 5 

Tx p Tbx Val 
2 10 

Lyt Txp Atp 

2 9 5 



Ty x Axg 



II* Ola 



P t e T y x 



Val Tyx Cyt 
2 7 0 

Ata Arg Ola 
2 a 5 

Lyt Ola Oly 
3 0 0 



Atp 
Atp 
Pba 



Mo t 

Oly 
Oly 



Ata Val Ala Tbx Ata 

3 0 5 



Tb x 

3 1 0 



Atp Oly Lyt Ata 



Tyx 
3 1 5 



Cyt Oly Lao Pxo 



Oly 
3 2 0 



Ols Tyr Txp Loo Oly 
3 2 5 



Ata Atp Lyt II* 



s • x 
3 3 0 



Ola L*a Tbx Axg 



Me t 

3 3 5 



Oly 



Pxo Tbx Ola L*a L*o 
3 40 

Lyt Ala Bit Tyx Oly 

3 5 5 



II* Ola Met Ola 
3 4 5 

Oly Pb* Tbx Val 
3 6 0 



Atp Txp 



Lyt Oly Atp 
3 5 0 



Ola Ata Ola 



Ala A t i 

3 65 



Lyt Val 
Lyt Tyx 



Ola II* Ser Val Ata 

3 70 



Lyt Tyr Axg Oly 

3 7 5 



Tb r Ala 



Oly Ata Ala 
3 10 



L • a Met 



Atp Oly Ala Sax Ola 

3 8 5 



L«a Met Oly Ola 

3 90 



Arg 

3 9 5 



Tbx M* t Tbx II* 



Bl t 
4 0 0 



Ata Oly Mel Pb* Pb* 
4 0 5 



Ser Tbr Tyr Atp 



Arg 

4 1 0 



Atp Ata Atp Oly 



Txp 
4 1 5 



L • a 



Tbr Ser Atp Pro Axg 
4 2 0 



Lyt Ola Cyt Ser 
4 2 5 



Lyt Ola 



Txp Tyr 



4 3 3 



Oly Oly Ola Tyx Tbr 
4 5 0 

Val Val Txp Met Ata 

46 3 

Ser Met Lyt lit Axg 
4 t 5 



4 4 0 

Txp Alp Met Ala Lyt Hit 
4 5 5 



Atp Oly Oly 
4 3 0 

Oly Axg Tyx 
4 4 5 

Oly Tbr Atp 
4 6 0 



Txp Lyt Oly Ser Trp Tyx Sex Met Axg 

4 7 0 4 7 5 

Pro Pbe Pbe Pro Ola Ola 

490 



Oly 
Tyx 
Atp 

Ly t 



Txp 
Txp 
Oly 



Me t 
4 S 0 



( 2 ) INFORMATION FOR SBQ ID N03: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 10564 batepan 
( B ) TYPE: aadac acid 
( C ) SIRANDETJNESS: daobb 
( D )TOf>QLOOY:&a*ar 

( i i ) MOCPfTTLE TYPE: DNA (j iinii) 

< ▼ i i ) IMMEDIATE SOURCE: 

( i i ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: j«a(1799.1l7©, I973JD17. 2207-2390, 2310 
. . 2603. 421LXM1. 4645-4771, 57S*-5942. 7426 
. . 7703, 9342-S571) 



45 46 

-continued 

( s i )SBQUBNCBDGSC3UPI10N:SBQIDNO5: 

CTACACACTT CTTOAAOOCA AAOOCAATOC TOAAOTCACC TTTCATOTTC AAATCATATT 60 

AAAAAOTT AO CAAOATOTAA TTATCAOTOT ACTATOTAAA TCTTTOTOAA TOATCAATAA 120 

TTACATATTT TC ATT AT AT A TATTTTAOTA OAT A AT ATT T ATATACATTC AACATTCTAA ISO 

ATATAOAAAO TTTACAOAOA AAAATAAAOC CTTTTTTTCC AATCCTOTCC TCCACCTCTO 340 

CATCCCATTC TTCTTCACAO AOOCAACTOA TTCAAOTCAT TACATAOTTA TTOAOTOTTA 3 00 

ACTACAACTA TOTTAAOTAC AOCTATATAT OTTAOATOCC OTAOCCACAO AAATCAOTTT 360 

ACAATCTAAT OCAOTOOATA CAOCATOTAT ACATATAATA TAAOOTTOCT ACAAATOCTA 420 

TCTOAOOTAO AOCTOTTTOA AAOAATACTA ATACTTAAAT OTTTAATTCA ACTOACTTOA 4t0 

TTOACAACTO ATT AOCTOAO TOOAAAAOAT OGATOAOAAA OAT TO TO AO A CTTAATTOOC 340 

TOOTOOTATO OTOATATOAT TOACAATAAC TOCTAAOTCA OAOAOOOATA TATTAAOOAO 600 

OAOAAOAAAA OCAACAAATC TOOTTTTOAT OTOTTCACTT TOTTATAATT ATTOATTATT 640 

TACTOAATAT OAATATTTAT CTTTOTTTTT OAOTCAATAA AT AT ACCTTT OTAAAOACAO 720 

AATTAAAOTA TTAOTATTTC TTTCAAACTO OAOOCATTTC TCCCACTAAC ATATTTCATC 7»0 

AAAACTTATA ATAAOCTTOO TTCCAOAOOA AOAAATOAOO O AT AACC AAA AATAOAOACA 640 

TTAATAATAO TOTAACOCCC AOTOATAAAT CTCAATAOOC AOTOATOAC A OACATOTTTT 900 

CCCAAACACA AOOATOCTOT AAOOOCCAAA CAOAAATOAT OOCCCCTCCC CAOCACCTCA 960 

TTTTOCCCCT TCCTTCAOCT ATOCCTCTAC TCTCCTTTAO ATAC AAOOOA OOTOOATTTT 1020 

TCTCTTCTCT OAOATAOCTT OATOOAACCA CAOOAACAAT OAAOTOOOCT CCTOOCTCTT 1060 

TTCTCTOTOO CAOATOOOOT OCCATOCCCA CCTTCAOACA A A OO OA AO AT TOAOCTCAAA 1140 

AOCTCCCTOA OAAOTOAOAO CCTATOAACA TOOTTOACAC AOAOOOACAO OAATOTATTT 1200 

CCAOOOTCAT TCATTCCTOO OAATAOTOAA CTOOOACATO OOOOAAOTCA OTCTCCTCCT 1260 

OCCACAOCCA CAOATTAAAA ATAATAATOT TAACTOATCC CTAOOCTAAA ATAATAOTOT 1320 

TAACTOATCC CTAAOCTAAO AAAOTTCTTT TOOTAATTCA OOTOATOOCA OCAOOACCCA 13 SO 

TCTTAAOOAT AOACTAOOTT TOCTT AO TTC OAOOTCATAT CTOTTTOCTC TCAOCCATOT 1440 

ACTOOAAOAA OTTOCATCAC ACAOCCTCCA OOACTOCCCT CCTCCTCACA OCAATOOATA 1500 

ATOCTTCACT AOCCTTTOCA OATAATTTTO OATCAOAOAA AAAACCTTOA OCTOOOCCAA 1560 

AAAOOAOOAO CTTCAACCTO TOTOCAAAAT CTOOOAACCT OACAOTATAO OTTOOOOOCC 1620 

AOOATOAOOA AAAAOOAACO OOAAAOACCT OCCCACCCTT CTOOTAAOOA OOCCCCOTOA 1660 

TCAOCTCCAO CCATTTOCAO TCCTOOCTAT CCCAOOAOCT TACATAAAOO OACAATTOOA 1740 

OCCTOAOAOO TOACAOTOCT OACACTACAA OOCTCOOAOC TCCOOOCACT CAOACATC 1796 

ATO AOT TOO TCC TTO CAC CCC COO A A T T T A ATT CTC TAC TTC TAT OCT 1646 

M« t S«r Trp Sar Laa Hit Pro Af | Aia Laa lie Laa Tyr Paa Tyi Ala 

15 10 15 

CTT TTA TTT CTC TCT TCA ACA TOT OTA OCA OTAAOTOTOC TCTTCACAAA 1196 
L«a L • v Pb * Laa Sax Sar Tbi Cyt Vil All 

2 0 2 5 

ACOTTOTTTA AAATOOAAAO CTOOAAAATA AAACAOATAA TAAACTAOTO A AATTTTCOT 1956 

ATTTTTTCTC TTT T AO TAT OTT OCT ACC AO A OAC AAC TOC TOC AT C TTA 2005 

Tyr Vi I Alt Tar Arg A»p Ata Cya Cy i 1 1 • Ln 

3 0 3 5 

OAT OA A AO A TTC OTAAOTAOTT TTTATOTTTC TCCCTTTOTO TOTOAACTOO 2057 
A • p O 1 o Art Pb c 
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-continued 

4 0 

AOAOOOOCAO AOOAATAOAA ATAATTCCCT CATAAATATC ATCTOOCACT TO T AACTTTT 3117 

TAAAAACATA OTCTAOOTTT TACCTATTTT TCTTAATAOA TTTT AAOAOT AOCATCTOTC 3 177 

TACATTTTTA ATCACTOTTA TATTTTCAO OOT AOT TAT TOT CCA ACT ACC TOT 3230 

Ol 7 Ser Ty i C y t Pro Tbr Tbr Cys 
4 3 

OOC ATT OCA OAT TTC CTO TCT ACT TAT CAA ACC AAA OTA OAC A AO OAT 2271 

Oly lie Ala Asp Pbc Lao Ser Tbi Tyr Ola Tbr Lys Val Asp Lyi Asp 

50 5 5 60 65 

CT A CAO TCT TTO OA A OAC ATC TTA CAT CAA OTT OA A AAC AAA ACA TCA 33 26 

Leo Ola Ser Lao Olo Atp I 1 i Leo Bit Ola Val Ola Asa Lys Tbr Ser 
7 0 7 5 SO 

OA A OTC AAA CAO CTO ATA AAA OCA ATC CAA CTC ACT TAT A A T CCT OAT 3 3 74 

Olo Val Lys Ola Lao II* Lys Ala III Ola Lao Tbr Tyr Asa Pro Aap 
> 5 90 95 

OA A TCA TCA AAA CCA A OTOAOAAAAT AAAOACTACT O AC C AAA A A A 3 420 

Ola Ser S«r Lys Pro 
1 0 0 

^ TAATAATAAT AATCTOTOAA OTTCTTTTOC TOTTOTTTTA OTTOTTCTAT TTOCTTAAOO 2410 

*Q ATTTTTATOT CTCTOATCCT ATATTACAO AT ATO ATA OAC OCT OCT ACT TTO 2532 

^ Asa M*tll« Asp Ala Ala Tbr Lao 

HI A AO TCC AOO ATA ATO TTA OA A OA A ATT ATO AAA TAT OA A OCA TCO ATT 25 SO 

J- Lys Ser Arg lie Met Lao Olo Olo 11a Mai Lys Tyr Olo Ala Sir lie 

115 12 0 12 5 

^ TTA ACA CAT OAC TCA AOT ATT CO OTAAOOATTT TTOTTTTAAT TTOCTCTOCA 2633 

^ Leo Tbr His Asp Sar Sor lis Are, 

g: 13 0 

O AOACTOATTT AOTTTTTATT TAATATTCTA TACTTOAOTO AAAOTAATTT TTAATOTOTT 2693 

TTCCCCATTT ATAATATCCC AOTOACATTA TOCCTOATTA TOTTOAOCAT AOTAOAOATA 2753 

l n OAAOTTTTTA OTOCAATATA AATTATACTO OOTTATAATT OCTTATTAAT AATCACATTO 2113 

■.Q AAOAAAOATO TTCTAOATOT CTTCAAATOC TAOTTTOACC ATATTTATCA AAAATTTTTT 2S73 

kQ CCCCATCCCC CATTTATCTT ACAACATAAA ATCAATCTCA TAOOAATTTO OOTOTTOAAA 2933 

ATAAAATCCT CTTTATAAAA ATOCTOACAA ATTOOTOOTT AAAAAAATTA OCAAOCAOAO 2993 

OCATAOTAAO OATTTTOOCT CCTAAAOTAA ATTATATTOA ATOTOOAOCA OOAAOAAACA 305 3 

TOTCTTOAOA OACTAAOTOT OOCAAATATT OCAAAOCTCA TATTOATCAT TOCAOAATOA 3 113 

ACCTOCATAO TCTCTTCCCT TCATTTOOAA OTOAATOTCT CTOTTAAAOC TTCTCAOOOA 3 173 

CTCATAAACT TTCTOAACAT AAOOTCTCAO ATACAOTTTT AATATTTTTC CCCAATTTTT 3233 

TTTTCTOAAT TTTTCTCA A A OCAOCTTOAO AAATTOAOAT AAATAOTAOC TAOOOAOAAO 3293 

TOOCCCAOOA AAOATTTCTC CTCTTTT TOC TATCAOAOOO C CCTTOTT AT TATTOTTATT 3 3 5 3 

ATTATTACTT OCATTATTAT TOTCCATCAT TOAAOTTOAA OOAOOTTATT OTACAOAAAT 34 13 

TOCCTAAOAC AAOOTAOAOO OAAAACOTOO ACAAATAOTT TOTCTACCCT TTTTTACTTC 3473 

AAAOAAAOAA COOTTTATOC ATTOTAOACA O TTTT CT ATC ATTTTTOOAT ATTTOCAAOC 3 5 3 3 

CACCCTOTAA OTAACTACAA AAOOAOOOTT TTT ACTTCCC CCAOTCCATT CCCAAAOCTA 3593 

TOTAACCAOA AOCATTAAAO AAOAAAOOOO AAOTATCTOT TOTTTTATTT TACATACAAT 3653 

AACOTTCCAO ATCATOTCCC TOTOTAAOTT ATATTTTAOA TTOAAOCTTA TATOTATAOC 3713 

CTCAOTAOAT CCACAAOTOA AAOOTATACT CCTTCAOCAC ATOTOAATTA CTOAACTOAO 3773 
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-continued 

CTTTTCCTOC TTCTAAAOC A TCAOOOOOTO TTCCTATTAA CCAOTCTCOC CACTCTTOCA 3 S3 3 

OOTTOCTATC TOCTOTCCCT TATOC AT A AA OTAAAAAOCA AAATOTCAAT OACATTTOCT 3S93 

TATTOACAAO GACTTTOTTA TTTOTOTTOO OAOTTOAOAC AATATOCCCC ATTCTAAOTA 3953 

AAAAOATTCA OOTCCACATT OTATTCCTOT TTTAATTOAT TTTTTOATTT OTTTTTCTTT 40 13 

TTCAAAAAOT TTATAATTTT AATTCATOTT AATTTAOTAA TATAATTTTA CATTTTCCTC 4073 

AAOAATOOAA TAATTTATCA OAAAOCACTT CTT AAOAAAA TACTTAOCAO TTTCCAAAOA 4 133 

AA AT AT AAA A TTACTCTTCT OAAAOOAATA CTTATTTTTO TCTTCTTATT TTTOTTATCT 4193 

TATOTTTCTO TTTOT AO A TAT TTO CAO O AA ATA TAT AAT TCA AAT AAT CAA 4244 

Tyr Lav Ola Oil 11* Ty i Ata Sar Ata Ata Ola 

13 3 140 145 

A AO ATT OTT AAC CTO AAA OAO A AO OTA OCC CAO CTT OAA OCA CAO TOC 4292 

Lyi I 1 « Val Asa Laa Ly i Ola Lyi Val Ala Ola Laa Ola Ala Ola Cy • 
15 0 15 5 160 

CAO OAA CCT TOC AAA OAC ACO OTO CAA ATC CAT OAT ATC ACT OOO AAA O 434 1 

Ola Ola Pro Cyi Lyi Aap Tbr Val Ola Ila Bit Asa 11a Tat Ol y Ly i 

16 5 17 0 17 5 

OTAACTOATO AAOOTTATAT TOOOATTAOO TTCATCAAAO TAAOTAATOT AAAOO AO A AA 440 1 

OTATOTACTO OAAAOTATAO OAATAOTTTA OAAAOTOOCT ACCCATTAAO TCTAAOAATT 446 1 

TCAOTTOTCT AOACCTTTCT TOAATAOCTA AAA AAA AC AO TTT AAAAOOA ATOCTOATOT 45 2 1 

OAAAAOTAAO AAAATTATTC TTOOAAAATO AATAOTTTAC TACATOTTAA AAOCT ATTTT 4511 

TCAAOOCTOO CACAOTCTTA CCTOCATTTC AAACCACAOT AAAAOTCO AT TCTCCTTCTC 464 1 

T AO AT TOT CAA OAC ATT OCC AAT A AO OOA OCT AAA CAO AOC OOO CTT 46 St 

Aip Cyi Ola Aip 11* Ala Aia Lya Oly Ala Ly t Ola Sar Oly Laa 
ISO 1 S 5 190 

TAC TTT ATT AAA CCT CTO AAA OCT AAC CAO CAA TTC TTA OTC TAC TOT 4736 

Tyr P fa • 11* Lyi Pre Laa Lyi Ala An Ola Ola P a • Lav Val Tyr Cyi 

195 200 305 

OAA ATC OAT OOO TCT OOA AAT OOA TOO ACT OTO TTT CAO A AO 477S 

OId I 1 • A*p Oly Sar Oly Aim Oly Trp Tar Val Phi Ola Lyi 

2 10 2 13 2 2 0 

OTAATTTTTT CCCCACCATO TGTATTTAAT AAATTCCTAC ATTOTTTCTO CCATATOOCA 4131 

O AT ACT TTTC TAAOCACCTT OTOAACCOTA OCTCATTTAA TCCTTOCAAT AOCCCTAAOA 4191 

OOAAOOTACT T CTOTT ACT C CTATTTACAO AAAAOOAAAC TOAOOCACAC AAOOTTAAAT 495 S 

AACTTOCCCA AOACCACATA ACTAATAAOC AACAOAOTCA OCATTTOAAC CTAOOCAOTA 50 IS 

TAOTTTCAOA OTTTOTOACT TOACTCTATA TTOTACTOOC ACTOACTTTO TAOATTCATO 507S 

OTOOCACATA ATCATAOTAC CACAOTOACA AATAAAAAOA AOOAAACTCT TTTOTC AOOT 5 13S 

AOOTCAAOAC CTOAOOTTTC CCATCACAAO ATOAOOAAOC CCAACACCAC CCCCCACCAC 5 19S 

CCCACCACCA TCACCACCCT TTCACACACC AOAOOATACA CTTOOOCTOC TCCAAOACAA 5 25S 

OOAACCTOTO TTOCATCTOC CACTTOCTOA TACCCACTAO OAATCTTOOC TCCTTT ACTT 53 IS 

TCTOTTTACC TCCCACCACT OTTATAACTO T TTCT AC AOO OOOCOCTCAO AOOOAATOAA 3 3 7 S 

TOOTOOAAOC ATTAOTTOCC AOACACCOAT TOAOCAATOO OTTCCATCAT AAOTOTAAOA 543 S 

ATCAOTAATA TCCAOCTAOA OTTCTOAAOT COTCTAOOTO TCTTTTT A AT ATTACCACTC 349S 

ATTTAOAATT TATOATOTOC CAOAAACCCT CTTAAOTATT TCTCTT ATAT TCTCTCTCAT 555S 

OATCCTTOCA OCAACCCTAA OAAOTAACCA TCATTTTTCC TATTTOATAC ATOAOOAAAC 56 IS 

TOAOOTAOCT TOOCCAAOAT CACTTAOTTO OOAOTTOATA OAACCAOTOC TCTO T ATTTT S67S 

TOACAAAATO TTOACAOCAT TCTCTTTACA TOCATTOATA OTCT ATTTTC TCCTTTTOCT 573 S 



51 



52 



-continued 



CTTOCAAATO TOT AATT AO AOA CTT OAT OOC AOT OTA OAT TTC AAO AAA AAC 

Arg L«o A»p Oly Sar Vil Atp Pbe Ly • Ly • Aea 

325 230 

TOO ATT CAA TAT AAA OAA OOA TTT OOA CAT CTO TCT CCT ACT OOC ACA 

Trp II* Ola Tyr Ly i Ola Oly Pbe Oly. Hit Lav Sar fro Tar Oly Tar 
235 240 245 

ACA OAA TTT TOO CTO OOA A A T OAO AAO ATT CAT TTO ATA AOC ACA CAO 

Tbr OIo Phi Trp Lea Oly Ate Ola Ly i Xla Hat Lit I I • Sar Tbr Ola 

250 255 260 265 

TCT OCC AT C CCA TAT OCA TTA AOA OTO OAA CTO OAA OAC TOO AAT OOC 

Sar Ala lie Pra Tyr Ala Lea Arg Val Oln Lid OIo A • p Trp A • a Oly 

270 375 2 10 



AOA ACC 
Arg Tbr 



AO OTACTOTTTT OAAATOACTT CCAACTTTTT ATTOTAAAOA 
Sar 



TTOCCTOOAA TOTOCACTTT CCAACTATCA ATAOACAATO OCAAATOCAO CCTO ACAAAT 

OCAAACAOCA CATCCAOCCA CCATTTTCTC CAOOAOTCTO TTTOOTTCTT OOOCAATCCA 

AAAAOOTAAA TTCTATTCAO OATOAATCTA AOTOTATTOO TACAATCTAA TTACCCTOOA 

ACCATTCAOA OTAATAOCTA ATTACTOAAC TTTTAATCAO TCCCAOOAAT TOAOCATAAA 

ATTATAATTT TATCTAOTCT AAATTACTAT TTCATOAAOC AOOTATTATT ATTAATCCCA 

TTTTATAOAT TAACTTOCTC AAAOTCACAT TOCTOATAAO TOOTAOAOOT AO AA TTC AO A 

CTCAAOTAOT TTAACTTTAO AOCCTOTCCT CTTAACAACT ATCCTOOTTO AAA AOC A AAT 

ACAOCCTCTT CAOACTTCTC AOTOCCTTOA TOOCCATTTA TTCTOTCAAA TCATOAOCTA 

CCCTAAAAOT AAACCAOCTA OCTCTTT TO A TOATCTAOAO OCTTCTTTTT OCTTOAOATA 

TTTOAAOOTT TTAAOCATTO TTACCTAATT AAAATOCAOA AAAATATCCA ACCCTCTTOT 

TATOTTTAAO OAATAOTOAA ATATATTOTC TTCAAACACA TOOACTTTTT TTT ATTOCTT 

OOTTOOTTTT TAATCCAOAA AOTOCTATAO TCAOTAOACC TTCTTCTAOO AAAOOACCTT 

CCATTTCCCA OCCACTOOAO ATTAOAAAAT AAOCTAAATA TTTTCTOOAA ATTTCTOTTC 

AT TCATTAAO OCCCATCCTT TCCCCCACTC TATAOAAOTO T T O T CCA CTT OCACAATTTT 

TTCCAOOAAA OAATCTCTCT AACTCCTTC A OCTCACATOC TTTOOACCAC ACAOOOAAOA 

CTTTO ATTOT OTAATOCCCT CAOAAOCTCT CCTTCTTOCC ACTACCACAC TOATTTOAOO 

AAOAAAATCC CTTTAOCACC TAACCCTTCA OOTOCTATOA OTOOCTAATO OAACTOTACC 

TCCTTCAAOT TT TOTOC A A T AATTAAOOOT CACTCACTOT CAOATACTTT CTOTOATCTA 

TOATAATOTO TOTOCAACAC ATAACATTTC AATAAAAOTA OAAAATATOA AATTAOAOTC 

ATCTACACAT CTOOATTTOA TCTTAOAATO AAACAAOCAA AAAAOCATCC AAOTOAOTOC 

AATTATTAOT TTTCAOAOAT OCTTCAAAOO CTTCTAOOCC CATCCCOOOA AOTOTTAATO 

AOCTOTOOAC TOOTTCACAT ATCT AT TOCC TCTTOCCAOA TTTOCAAAAA ACTTCACTCA 

ATOAOCAAAT TTCAOCCTTA AOAAACAAAO TCAAAAATTC CAAOOAAOCA TCCTACOAAA 

OAOOOAACTT CTOAOATCCC TOAOOAOOOT CAOCATOTOA TOOTTOTATT TCCTTCTTCT 

CAO T ACT OCA OAC TAT OCC ATO TTC AAO OTO OOA CCT OAA OCT OAC 

Tbi Ala Atp Tyr Ala Met Pbe Ly i Val Oly Pra Ola Ala Aip 

213 290 295 

AAO TAC COC CTA ACA TAT OCC TAC TTC OCT OOT OOO OAT OCT OOA OAT 

Lyi Tyr Arg Lea Tbr Tyr Ala Tyr Pbe Ala Oly Oly Asp Ala Oly Aip 

300 305 310 

OCC TTT OAT OOC TTT OAT TTT OOC OAT OAT CCT AOT OAC AAO TTT TTC 

Ale Pbe Aip Oly Pb* Aip Pbe Oly Aip Atp Pra Sar Aip Lyt Pba Pba 

3 13 320 323 J30 



5 79 0 

5 t 3 t 
5116 
5 9 3 4 

5 9 t 3 

6 04 2 
6 10 2 
6 162 
6223 
626 2 
6542 
6402 
6462 
65 2 2 
65 1 2 
6642 
6702 
6762 

6 9 2 2 
6112 
6942 
7002 
70 6 2 

7 12 3 
7 113 
7 2 4 2 

73 02 
7 3 6 2 

74 2 2 
7 4 6 1 

7 5 16 

7 5 6 4 



53 
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-continued 



AC A TCC CAT A AT OOC ATO CAO TTC AOT ACC TOO 
Tkt Ser Bii Aim Oly Met Ola Pb* Ser Tbr Trp 
935 MO 

A AO TTT OAA OOC AAC TOT OCT OA A CAO OAT OOA 
Lyi Pb« Ola Oly Asa Cyi Ala Olo Ola Asp Oly 

3 5 0 3 5 5 

AAC AAO TOT CAC OCT OOC CAT CTC A AT OOA OTT 
Asa Lys Cys Hit Ala Oly Bis Lao Asa Oly Va 1 
365 370 

OTATOTTTTC CTTTCTTAOA TTCCAAOTTA ATOTATAOTO 

TAATAAATAO ATATOAAOAA ATOAAOAATA ATTTATAAAO 

TTCTTT ATTT CAACTAAOTT CTTTOAAACT OOAAOTOOAT 

AATTAOCCCT TCTAAAOAAA TCCACCTOCT OCAAAATATC 

TOAAACTATC ACCATCATAO CTOOCACTOT OOOTTOTOOO 

ATAAATOATC TOOATOOATT AACATTACTA CATOOATOCT 

TTCCCATOAO CTTTOTOTCA OATACACOCA OTOAACAOOT 

OAOAAOOCAA OCACTOOTAA OOOCAOOOOT TTOTOAAAOC 

OAOOACAOTA OACACTTATT TTAOOATOOO OOTTOOATOA 

AOCTTOOAAT OOTTTOOAAC ACTOOTTTCA CTCACCTACC 

OCCTTACCOA TOCTAAAOOA TCCATOTTAC AATAATOOCA 

TATTCCATOA ATAAAACCAC TATOAAOATA ATCCCACTCA 

OOACAOCAAC ACCACCCTOO OAAAOCCAAA CAOTCAOACC 

OOACTTCCCT ACCATATCTO CTOOOTAOAT OAOTOAAACC 

OCTTOTAOCA AACCATAOTC TCCTC AT CT A CCAAOATOAO 

CTAOCCAATC ACCAACTAOO AAACTTTOCA CAOTTTATTT 

ACAATATTTT TAAATTOOAO AAACATAACT TATCTTTOCA 

AAOAAACTCT AAOOOAAAAT OCTTOATCTO TOTOACCCOO 

OTTCATOCCA OTOTTOTOCT CTOACAAOCC TTTTACAOAA 

TAOOACAAOO AOAAOOCAAA TCAACAOAOO CTOCACTTTA 

ATOCCAOAAC CATTTCCTAA AOCTCCTCAA TCAACCAACA 

CTOAOTTOAC CTCATCAOOA A T TTTO TOOC TCCTTCTCTT 

TOOTCCACAO CAOCTOAOTC COOOATOOAT AAOCTTAOOO 

TTTOOOTTTC TAOCCCTACT AOTAOTOAAT AAATTTAAAO 

ACAOCACAOA TOTTOTTTAA TAATATOTTT ATTTTATAAA 

OOAOATATTT TCAOTTAOCA OATAATACTA TAAATTTT AT 

TAATAOACAO CTCTTCATAO ACTTOCAOAO OTAAAAAOAT 

ATCT ACOACT TOTTTT AO OT OOC ACT TAC TCA AAA 

Oly Oly Tar Tyr S a r Lys 
3 ft 0 

OOT TAT OAT A AT OOC ATT ATT TOO OCC ACT TOO 



OAC AAT OAC AAT OAT 

Asp Asa Asp Asa Asp 
345 

TCT OOT TOO TOO ATO 

Ser Oly Trp Tip Mat 
3 60 



Oly Tyr Asp Asa Oly 
3 9 0 



It 111 Trp Ala Tbr Trp 
395 



TCC ATO AAO AAA ACC ACT ATO AAO ATA ATC CCA 
S«r Mat Lys Lys Tbr Tbr Mat Lys II* Ila Pre 



4 05 



4 1 0 



ATT OOA OAA OOA CAO CAA CAC CAC CTO OOO OOA 



TAT TAC CAA O 
Tyr Tyr Ola 
3 7 5 

TAT ACT ATTT TCATAAAAAA 

ATAOTAOOOA TTTTATCATO 

AAT ACC A AOT TCATOCCT A A 

CAOTAOTTTO OCATTATATO 

ATCTCCTTTA OACATACAAC 

TOTTOACACA TTAACCTOOC 

OTTTOOAOOA AC AO AAT AAA 

T TO AO AO A AO AOACCAOTCT 

OOAOOCTATA OTTTOCTATA 

CAOCAOTTAT OTOTOOOOAA 

TT ATT TOO A A ATCCCAOTOO 

ACAOACTCTC COTTOOAOAA 

AOACCTOTTT AOCATCAOTA 

AOTOTTCCAA ACCACTCCOO 

CAACCTTACC TCCTOATOTC 

AAAOTAACAO TTTOATTTTC 

CTC AC A A ACC ACATAATOAO 

OOCOCCATOC CAOAOCTOTA 

TTACATOAOA TCTOCTTCCC 

AAATOOAOAC ATAAAATAAC 

AAATTOTOCT TT CAA AT AAC 

CTAACCTOCC TOAAOAAAOA 

ACAOAOOCCA ATTAOOOAAC 

TOTOOATOTO ACTATOAOTC 

TTOATATTTT AOO A ATCTTT 

OTAACTOOCA ATOCACTTCO 

TCCAOAATAA TOAT ATOTAC 

OCA TCT ACT CCT AAT 

Ala Sar Tbr Pre Asa 
3 0 3 

AAA ACC COO TOO TAT 

Lys Tbr Arg Trp Tyr 
40 0 

TTC AAC AO A CTC ACA 

Pba Asa Arg Lav Tbr 
4 1 5 

OCC AAA CAO OTC AO A 



76 12 

7660 

7703 

7763 
7 t 23 
7003 
7943 
1003 

• 063 

• 1 23 
t 1 03 
0 243 
0 3 03 

• 3 63 
§423 
I4IS 
0 5 4 3 

• 603 

• 663 

• 723 

• 7 S3 

• •43 

• 903 

• 963 
9 0 2 3 
901) 
9 143 
9 203 
9263 
93 2 3 
9 3 7 3 

9 4 2 1 
9469 
9 5 17 
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-continued 

lie Ol r Ola Oly Ola Ola Hi* Bii Leo Oly Oly Ala Lyi Ola Val Aft 
420 4 25 490 435 

CCA OAO CAC CCT OCO OAA ACA OA A TAT OAC TCA CTT TAC CCT OAO OAT 95 6 5 

Pre Olo Bit Pro Ala Ola Thi Ola Tyr Asp Sar Laa Tyr Pro Ola Atp 
440 445 450 

OAT TTO TAOAAAATTA ACTOCTAACT TCTATTOACC CACAAAOTTT CAOAAATTCT 90 2! 

A • p Laa 

CTOAAAOTTT CTT CCTTTTT TCTCTT ACT A T.ATTTATTOA TTTCAAOTCT TCTATT AAOO 96 t: 

ACATTTAOCC TTC AATOOAA ATTAAAACTC ATTTAOOACT OTATTTCCAA ATTACTOATA 9741 

TCAOAOTTAT TTAAAAATTO TTTATTTOAO OAOATAACAT TTCAACTTTO TTCCTAAATA 9$0 

TATAATAATA AAATOATTOA CTTTATTTOC ATTTTTATOA CCACTTOTCA TTTATTTTOT 9 » 6 

CTTCOTAAAT TATTTTCATT ATATCAAATA TTTTAOTATO TACTTAATAA AATAOOAOAA 992: 

CATTTTAOAO TTTCAAATTC CCAOOTATTT T CCTTOTTT A TTACCCCTAA ATCATTCCTA 99a 

TTTAATTCTT CTTTTTAAAT OOAOAAAATT ATOTCTTTTT AATATOOTTT TTOTTTTOTT 10041 

ATATATTCAC AOOCTOOAOA COTTTAAAAO ACCOTTTCAA AAOAOATTTA CTTTTTTAAA 1010: 

OOACTTTATC TOAACAOAOA OATATAATAT T TTTCCT ATT OOACAATOOA CTTOCAAAOC 1016 

TT C ACT T C AT TTT AAOAOCA AAAOACCCCA TOTTOAAAAC TCCATAACAO TTTTATOCTO 1022 

ATOATAATTT ATCTACATOC ATTTCAATAA ACCTTTTOTT TCCTAAOACT AOATACATOO 1031! 

TACCTTTATT OACC ATTAAA AAACCACCAC TTTTTOCCAA TTTACCAATT ACAATTOOOC 1054] 

AACCATCAOT AOTAATTOAO TCCTC ATTTT ATOCTAAATO TTATOCCTAA CTCTTTOOOA 1040: 

OTTACAAAOO AAATAOCAAT TATOOCTTTT OCCCTCTAOO AOATACAOOA C A A A T AC AOO 1046 

AAAATACAOC AACCCAAACT OACAATACTC TATACAAOAA CATAATCACT AAOCAOOAOT 1052 

CACAOCCACA CAACCAAOAT OCATAOTATC CAAAOTOCAO CTO 10564 

( 1 ) INFORMATION FOR SEQ ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 453 mauo acidi 
( B ) TYPE: anaao mad 
( D )TOPOLOOY: baear 

< i i ) MOLECULE TYPE: fractu 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NM 

Mat S«r Trp Sar L«o Hit Pre At | Ata Lea I 1 • Lea Tyr Pbe Tyr Ala 
I 3 10 15 

Lea Laa Pbe Lea Sar Sar Tor Cyt Val Ala Tyr Val Ala Tor Arg Aip 

2 0 2 3 3 0 

Asa Cyt Cy i III Laa Atp Ota Arg Pbe Oly Sar Tyr Cyt Pro Thr Tor 
3 5 40 4 5 

Cyi Oly 11a Ala Atp Pbe Laa Sar Tar Tyr Ola Tbr Lyt Val Atp Lyi 

5 0 5 3 6 0 

Atp Laa Ola Sar Lea Ola Atp Me Lea Bit Ola Val Ola Ata Lyt Thr 
63 70 73 SO 

Ser Ola Val Lyt Ola Laa II* Lyt Ala lie Ola Lea Tbr Tyr Ata Pro 
» 5 9 0 9 5 

Atp Ola Ser Ser Lyt Pro Ata Mai 11a Atp Ala Ala Tbr Lea Lyt Ser 

10 0 10 5 110 

Arg lie Met Lea Olo Ola Ila Mel Lyt Tyr Ola Ala Ser lie Lea Tbr 

1 I 3 t20 125 

Hit Atp Sar Ser lie Arg Tyr Laa Ola Ola 11a Tyr Ata Ser Ata Ata 



1 3 0 



1 40 
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-continued 



Oil Lyi lie V»l Ata Lee Lyi Olo Lp Vil All Ola Lee Ola All Ola 

145 150 155 160 

Cyt Ola Ola Pre Cyi Lyi Atp Tbr V*l Ola I l« HI* Asp I It Tar Oi y 
165 170 175 

Lji Atp Cyi Ola Atp 11a Ala Ata Lyi Oly Ala Lyi Ola 8ar Oly Laa 
ISO 1 t 5 190 

Tyr Pba III Lyi Pre Laa Lys Ala Ata Ola Ola Pfaa Laa Val Tyr Cp 

195 200 205 

Olo 11« Atp Oly Sir Oly Ata Oly Trp Tbr Val Pba Ola Lyi Art La a 

2 10 2 15 220 

Atp Oly Sir Vil Atp Phi Ly t Lyt Ata Trp 111 Ola Tyr Lyi Ola Oly 
225 230 255 240 

Pba Oly Hit Lid Sir Pre Tbr Oly Tbr Tbr Ola Pbi Trp Leo Oly An 
245 250 255 

Olo Lyt III Hit Lio III Sir Tbr Ola Sir Ala III Pre Tyr Ala Lie 
260 265 270 

Arg Val Olo Lio Olo Atp Trp Ata Oly Art Tbr Sir Tbr Ala Atp Tyr 
275 210 2 15 



Ala Met Pba Lyt Val Oly Pro Olo Ala Atp Lyt Tyr Arg Lao Tbr Tyr 



2 90 



2 95 



3 00 



Ala Tyr Pba Ala Oly Oly Atp Ala Oly Atp Ala Pba Atp Oly Pba Atp 



3 0 5 



3 1 0 



3 1 5 



Pbc Oly Atp Atp Pre Sir Atp Lyt Pbi Pbi Tbr Sir Hit Ata Oly Mai 



3 2 5 



3 3 5 



Ola Pba Sar Tbr Trp Atp Ata Atp Ata Atp Lyt Pba Ola Oly Ata Cyt 



3 4 0 



3 4 5 



3 5 0 



Ala Olo Ola Atp Oly Sir Oly Trp Trp Mot Ata Lyt Cyt Hit Ala Oly 



3 5 5 



3 65 



Bit Lao Ata Oly Val Tyr Tyr Ola Oly Oly Tbr Tyr Sar Lyt Ala Sar 



3 70 



3 t 0 



Tbr Pro Ata Oly Tyr Atp Ata Oly 
3 8 5 3 9 0 

Arg Trp Tyr 



« lie Trp Ala Tbr Trp Lyt Tbr 
3 9 5 40 0 



>r Mat Lyt Lyt Tbr Tbr Mil Lyt III lli Pro Pbi Ata 



4 0 5 



4 1 0 



4 1 5 



Arg Lio Tbr III Oly Olo Oly Oil Ola Hit Hit Lie Oly Oly Ala Lyt 



Ola Val Arg Pro Ola Bit Pro All Olo Tbr Olo Tyr Atp Sar Lao Tyr 



4 3 5 



4 4 0 



445 



Pro Olo Atp Atp Lie 
4 5 0 



( 2 ) INFORMATION FOR SBQ ID NOtf: 

( i )SBOA)ENCBCHAJUCrEltIST]CS: 

( A ) LENGTH: lOgOTbatipain 
(B )TTPE:aoclaVcaekl 
( C )STJUNDEDNBSS:doofaki 
( D )TOPOLOOT: hamm 

( v i i ) IMMEDIATE SOURCE: 

( B ) CLONE: <mm bat* lamjgluUJa 

( i i ) SEQUENCE DESCRIPTION: SBQ ID NOrt: 



ACOCOTOTCO ACCTOCAOOT 
OTTTTOOTOO CTOTAOCTTT 
CTOTTCTCTC TCAAOATTOT 



CAACOOATCT CTOTOTCTOT 
CAOCTACAOT CTOAAOTCAT 
OTTCTOCTOT TTOOOTCTTT 



TTTCATOTTA OTACCACACT 
AAAOCCTOOT ACCTCCAOCT 
AOTOTCTCCA C AC A ATTTTT 













oU 




AOAATTOTTT 


OTTCTAOTTC 

U * A W A A X* A A L 


TflTOAAAAAT 

A V A V/ A A A A A 


OATOCTflOT A 

VAAVV A w w A A 


TTTTflATA Afl 
I A A A V A A AAU 


OATTrt**ATTO 
UAA 1 U L A A AU 


2 4 0 


AATCTQTiii 


/l/>T i r A rfl A T A 
Uk AALAUAAA 


T A A T f A TTflfl 
1 A U A L A J A U U 


CI T A ft T A ft A ft ^ 
U A AW A A1.AUT 


ft A ft T T T A A ft A 
CACTI 1 A A L A 


A Ta TTA A ft T ft 
AIATTAACTt 


3 0 0 


ITCACATCTO 

A A W A W A 1 V 1 \I 


Tfl Art^ A T A A T 
1 UAvFLAAUAl 


ATATTTTOf*** 
A A A A A A A L L L 


ft ft T ft T AT AT/"* 

1, L A L AATATv 


AT^TTOA ATT 

ATCAACAATT 


f+ f+ T 1 f+ ft T ATA a 

CCTCCTATCA 


3 6 0 


OTTTCTTTCA 

W A A ALA A A L A 


TTOf*AflTTTT 
. A 1 U L A U A 111 


<*TrtArtT A r A 
L 1 U A U A A L A U 


/lXr*TT A^ A ft ft 
V 1 L 1 AALAvC 


iLLl 1 UU 1 1 A 


A AATAATT^*S* 

OAOTCATTCC 


4 2 0 


TCAOTATTTT 

A L A VI A A A A 1 A 


ATTffTTTA A 
A A 1 V, 1. 1 A A 0 A 


T A /"* A A T T rt T 
1 A L A A 1 i U J U 


a AxnAonx a a 

A A 1 UAUUTAA 


TTTTf*TT A AT 

TTTTCTTAOT 


TTCTCTTTCT 


4 8 0 


OAT AOCTCAT 


TOTT AflTflT a 
A U A A A U A W 1 A 


TATATAflA AA 
1A 1 A 1 A U A A A 


AflTA ATAflAT 
AVLAALAWA A 


TTfTAT/lT AT 
1 ALAAAUAAA 


TA ATTTTAT a 
A A A A 1 A A U A A 


5 4 0 


TCCTOCAACA 


AATTTf*TATA 

U A A A 1 L A A 1 U 


tattaatttt 

1A1 A A A A 1 A 1 


VAAALLAULA 


A^TTTA^rtrtA 
A L I 1 1 A L V> VI A 


A T T ft A ft T T A T 

ATTCACTTAT 


6 0 0 


T AOCTI TTTO 

A W W A A A A A 


O T O A f a T <^ T T" 
UAWALAlLA 1 


n a n n a t t t t f 

U A U U A 1 1 1 A L 


TnA A ft A A A AT 
1 WAAwAAAA 1 


r\ fx ft A T ft ft T A T 
U U L A 1 UUA A 1 


A A T A A A A A A A 

OOTAOOACAA 


6 6 0 


OOTOT C ATOT 


PATPTOCA A A 
L A A L A UL A An 


r* a n T n n ft a/it 

L A U 1 UVVAU I 


IAAVLAIlAA 


ft ft ft T T ft ft A A ft 
LV.L I 1 L L A A L 


^TAA A *P*rVAV 

CTOOATTTCT 


7 2 0 


TTO ATTTCTT 


TPTflTrTnin 

A L 1 « 1 v 1 U Au 


TATOArT Aflfi 
1 A L U A L 1 A U U 


ATT<^^*r*A A T A 
AA ALLLAAAA 


f T A T A ft ft ft A A 
C A A 1 A L L U A A 


T AAAA A ^ A A A 

IAAAAOIOOC 


7 t 0 


AAOAOTOOAC 


ATPPTTflTTT 

A A L L A 1 u 1 V 1 


TATTTTTf*Tf? 

AAA 1 I I A L A U 


A L w A 1AUAW 


A A ATrtTTTTr 
AAA I U L 1 1 1 L 


AATTTT*VA A A 

AOTTTTTCAC 


8 4 0 


CATTAATTAT 


AATflTTTACT 

A A A \* A A A A L A 


flTflnnrTTfiT 
u i u u u l a a u a 


ft AT ATOTrtrtO 
L A A A 1 V 1 vUL 


r* T T ft ATTA TA 
L 1 1 L A 1 AAAA 


TAA A AATA 1 ^ A 

TOOAOOTCTA 


9 0 0 


TTCCCTCT AT 


ACCCACCTTO 

A^rLLALLA A \J 


TTflAflA/lTTT 
A 1 U A U A U 1 1 A 


TTATCATAAA 
A A A A LAI AAA 


AflTA TflTT/»A 
A VJ A A A V* A A vr A 


A TTTTAT/»A A 

ATTITOTCAA 


9 6 0 


AAOTT TTT CC 


TflCATCTATT 

AWLAALAAJ 1 


OAflATOATTT 
UAUA1UA1 A A 


TTAfTfTTfA 
A 1 A L A V- A ALA 


ATT/*ATTA at 
AA 1 L A 1 1 A A 1 


AATTTTT ATT 

OATTTTTATT 


10 2 0 


CTTCATTTTQ 

W A A X* A A A A W 


TT A ATflATTT 
1 A A A 1 U A 1 1 I 


ft ft A T T ^" T T A 
L L A A 1 t i i LA 


ATTTflTT A A ft 
AA 1 I \3 I 1 A A L 


n t n n t a ta t^* 


A A A T ^ ^1 A *V* 4* A 

ACATTOATTO 


10 8 0 


ATTTOTOOAT 


ACTTTTflT AT 
A L L A 1 A U A A 1 


ft c t ft n n a t a 

tLl 1 U U U A 1 A 


A A ft ft T ft A ft T T 
A A L L 1 L A L 1 I 


^1 A T ^ A T f\ A A ^ 

OATCATOAOC 


9 9 ^F* ^ A A A A 

TTTCAATOTA 


114 0 


TTT TTOAATT 


L A L A 1 1 U L 1 A 


ATATTf"TATT 
AAA! 1 L A W 1 A 


ft ft ft T ATTTTT 
UUU1AA 1111 


A A AT^T#*YAV 

QCATCTCTAT 


TCATCAATOA 


12 0 0 


T ATTOOCCT A 


AOAAAOOTTT 
AWAAAUUA A A 


TflTTTflnTTT 
1 U 1 V. i UU 1 Al 


T A ft T A T ft A ft ft 
AAU 1 AA LAvrU 


ft t n. a t n ft t *\ r\ 
OTOAAUlTOO 


CCTCATAOAO 


12 6 0 


AO AO T TT AO A 


A V L A J 1 1 vt 1 


/*f*Tr"TTTnAT 

LLIL11IUAI 


T T T T ft ft ft A AT 
111 1LUOAAT 


A f% T T T ft. K f\ T A 

AOTTTOAOTA 


OOATAOOTAT 


13 2 0 


TAACTCTTCT 


T-TAAATOTTT 

A A A A A A U A 1 A 


UUUU AL 1 1 v. v. 


f* T ft ft T ft A ft ft ft 
L 1 \J\J I UAUL I 




AATCCOCCTC 


13 8 0 


AOOOATOTOO 


OTTTflATfff* 
V/ A A 1 U A 1 L L L 


1 KJ \J l t AUUUA 


A^*^*ATXA A T" a 

AllAI TAATA 


A A A ^ ^ ^ A ^4 A 

AOATCCCACA 


TOCTOCAOOC 


14 4 0 


A AC A AOCCCC 


CAAOCTflCAA 

A A \J L A U L A A 


L L A L 1 U AvL 1 


Or*A A ft ft ft ft T ft 
ULAALLUL A U 


ft A AT ft ft ft A 

LAUaULLLAL 


AOOCCACOAC 


15 0 0 


C AO AO AA AOC 


CCAPATAPAf! 

X— W A Vrf A A A V. A U 


caooaa a n a r* 

V.AUUUAAUAL 


ftftAftftAftA Aft 
LLAVJLALAAL 


ft ft ft A A A A A A f\ 

LUUAAAAAUU 


A A ^ W ^1 ^ M +m *m 

AOTTTOOTOO 


15 6 0 


A AT A CAOCTO 


TOAAOCCOTC 


TflflTrPTflOA 

1 w w A L L A U U A 


ftTftftTClftTTtl 
LALLAwLl AU 


aA/iaa atttt 

auuuaaaaat 


TTAAAAATTA 


16 2 0 


T T O A T T C A AT 


TTCATTACTO 

A A L A A A A W A U 


V i AAl. 1 U U 1 L 


TnTTf*AT ATT 
1U1 1 L A 1 A 1 1 


TT^TaTTT^T 

ATCaATTTCT 


TCCOOOTTCA 


16 8 0 


OTCTTOOOAO 


ATTQTACATO 

A A A w A A L A A V 


CCTAOOAATO 

L L A A U U A A A U 


I KJ I L L W 1 1 1L 


T T ft T A ft ft T T ft 
A 1 L 1AUUTTO 


TCCATTTTAT 


17 4 0 


TOO AC ATOCA 


TOOOAOfAfA 

A WWVAWLALA 


CAOfAffOAf 
L A U L A L L U A V. 


fAftftftAftA ft T 
LAULUAUAL A 


L A A U L T U U C T 


TCCTOOOOCC 


110 0 


AOOCTOOOOC 


CCCAAflCAOC 

^ W A A W L A XJ V— 


A A W V L A A L L A 


Afl AflTflTflTn 


A AAA^^AA^T 

AAAULLLALI 


OACCCTOCCC 


18 6 0 


AOCCC C A C A A 


TTTCATTCTO 

A A A L A A 1 L A U 


ATiA AflTflATT 
AUAAUJUA1 1 


r*f*TTrtf*TTr*T 

L L 1 1 U L 1 1L1 


A^a ^*TTa /*aa 
OCACTTACAO 


OCCCAOOATC 


19 2 0 


TOACCTOCTT 


CTOAOOAOf A 

W A W A \M XJ A W V* A 


flfldflTTTTnn 

vJ w U U A 1 1 J U U 


ftAftftAftftftftft 
L AUUALUUvU 


AAA TAA^A AA 


AOCCOACOOO 


19 8 0 


OOT CCAOOTC 


CCCTCCCAOO 


L L L L L L A w 1 V 


T ft ft ft ft ft A ft ft ft 
A UVrV/ULAWLL 


ft T T ft ft ft A A AA 

L A AvUUAAAO 


ATTOCCCCAO 


2 0 4 0 


T CT CCCTCCT 


ACAOTQOTCA 


CTPrriflf Tn 


ft ft ft ft A ft ft ft ft A 
LLLLAUULLA 


ftAftftTftftTTT 

UAUCTOCTTT 


ATTTCCOTCT 


2 10 0 


CTCT CTCTOO 


ATOOTATTCT 


CTOOAAOCTO 


AAflOTTCCTQ 

AAVVA ALL1U 


AAflTTATOA A 
AAU1 AAAUAA 


T A AA^V^AAA 

TAOCTTTOCC 


2 16 0 


CTOAAOOOCA 


TOOTTTQTOO 

A W W AAA \J A W \J 


AWAL\J\lt ALA 


CAOrtAA<"TTf» 
LAUUAAL 1 1 U 


ft ft AA i/*/i/* l Fn 


C AOCTC AO AC 


2 2 2 0 


OT CCCOAOAT 


TOOTOOC ACC 


CAOATTTCCT 


A A \J L A LVL A U 


ft ft ft A A ft A ft ft ft 


A A <■ wmi wmt a <m 

COCTTOTTTC 


2 2 8 0 




ACCTCCCTCC 


TCCCTOCATC 


ACCCAOTTCT 


OA A AOCAOAO 


COOTOCTOOO 


2 3 4 0 


OTCACAOCCT 


CTCOCATCTA 


ACOCCOOTOT 


CC AAACC ACC 


COTOCTOOTO 


TTCOOOOOOC 


2 4 0 0 


TACCTAT00O 


OAAOOOCTTC 


TCACTOCAOT 


OOTOCCCCCC 


OTCCCCTCTO 


AOATCAOAAO 


2 4 6 0 


TCCCAOTCCO 


OACOTCAAAC 


AOOCCOAOCT 


CCCTCC AOAO 


OCTCCAOOOA 


OOOATCCTTO 


2 5 2 0 


CCCCCCCOCT 


OCTOCCTCCA 


OCTCCTOOTO 


CCOCACCCTT 


OAOCCTOATC 


TTOTAOACOC 


2 5 8 0 
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-continued 

CTCAOTCTAO TCTCTOCCTC COTOTTCACA COCCTTCTCC CCATOTCCCC TCCOTOTCCC 2640 

COTTTTCTCT CACAAOOACA CCOOACATTA OATTAOCCCC TOTTCCAOCC TCACCTOAAC 2700 

AOCTCACATC TOTAAAOACC TAOATTCCAA ACAA0ATTCC AACCTOAAOT TCCCOOTOOA 2760 

TOTOAOTTCT OOOOCOACAT CCITCAACCC CATCACAOCT TOCAOTTCAT COCAAAACAT 2 3 30 

OOAACCTOOO OTTTATCOTA AAACCCAOOT TCTTCATOAA ACACTOAOCT TCOAOOCTTO 3110 

TTOCAAOAAT TAAAOOTOCT AATACAGATC AOOOCAAOOA CTOAAOCTOO CTAAOCCTCC 2940 

TCTTTCCATC ACAOOAAAOO OOOOCCTOOO OOCOOCTOOA OOTCTOCTCC COTOAOTOAO 3000 

CTCTTTCCTO CTACAOTCAC CAACAOTCTC TCTOOOAAOO AAACCAOAOO CCAOAOAOCA 3060 

AOCCOOAOCT AOTTTAOOAO ACCCCTOAAC CTCCACCCAA OATOCTOACC AOCCAOCOOO 3 130 

CCCCCTOOAA AOACCCTACA OTTCAOOOOO OAAOAOOOOC TOACCCOCCA OOTCCCTOCT 3 110 

ATCAOOAOAC ATCCCCOCTA TCAOOAOATT CCCCCACCTT OCTCCCOTTC CCCTATCCCA 3240 

ATACOCCCAC CCCACCCCTO TOATOAOCAO TTTAOTCACT TAOAATOTCA ACTOAAOOCT 3300 

TTTOCATCCC CTTTOCCAOA OOCACAAOOC ACCCACAOCC TOCTOOOTAC COACOCCCAT 3360 

OTOOATTCAO CCAOOAOOCC TOTCCTOCAC CCTCCCTOCT COOOCCCCCT CTOTOCTCAO 3420 

CAACACACCC AOCACCAOCA TTCCCOCTOC TCCTOAOOTC TOCAOOCAOC TCOCTOTAOC 3410 

CTOAOCOOTO TOOAOOOAAO TOTCCTOOOA OATTTAAAAT OTOAOAOOCO OOAOOTOOOA 3340 

OOTTOOOCCC TOTOOOCCTO CCCATCCCAC OTOCCTOCAT TAOCCCCAOT OCTOCTCAOC 3600 

COTOCCCCCO CCOCAOOOOT CAOOTCACTT TCCCOTCCTO OOOTTATTAT OACTCTTOTC 3660 

ATTOCCATTO CCATTTTTOC TACCCTAACT OOOCAOCAOO TOCTTOCAOA OCCCTCOATA 3720 

CCOACCAOOT CCTCCCTCOO AOCTCOACCT OAACCCCATO TCACCCTTOC CCCAOCCTOC 37t0 

AOAOOOTOOO TOACTOCAOA OATCCCTTCA CCCAAOOCCA COOTCACATO OTTTOOAOOA 3 140 

OCTOGTOCCC AAOOCAOAOO CCACCCTCCA OOACACACCT OTCCCCAOTO CTOOCTCTOA 3900 

CCTOTCCTTO TCTAAOAOOC TOACCCCOOA AOTOTTCCTO OCACTOOCAO CCAOCCTOOA 3960 

CCCAOAOTCC AOACACCCAC CTOTOCCCCC OCTTCTOOOO TCTACCAOOA ACCOTCTAOO 4020 

CCCAOAOOOO ACTTCCTOCT TOOCCTTOOA TOOAAOAAOO CCTCCTATTO TCCTCOTAOA 4010 

OOAAOCCACC CCOOOOCCTO AOOATOAOCC AAOTOOOATT CCOOOAACCO COTOOCTOOO 4 140 

OOCCCAOCCC OOOCTOOCTO OCCTOCATOC CTCCTOTATA AOOCCCCAAO CCTOCTOTCT 4200 

CAOCCCTCCA CTCCCTOCAO AOCTCAOAAO CACOACCCCA OOOATATCCC TOCAOCCATO 4260 

AAOTOCCTCC TOCTTOCCCT OOOCCTOOCC CTCOCCTOTO OCOTCCAOOC CATCATCOTC 4320 

ACCCAOACCA TOAAAOOCCT OOACATCCAO AAOOTTCOAO OOTTOOCCOO OTOOOTOAOT 4330 

TOCAOOOCOO OCAOOOOAOC TOOOCCTCAO AOAOCCAAOA OAOOCTOTOA COTTOOOTTC 4440 

CCATCAOTCA GCTAOOOCCA CCTOACAAAT CCCCOCTOOO OCAOCTTCAA CCAOOCOTTC 4300 

ACTOTCTTOC ATTCTOOAOO CTOOAAOCCC AAOATCCAOO TOTTOOCAOO OCTOOCTTCT 4360 

CCTOCOOCCO CTCTCTOOOO AOCAOACOOC COTCTTCTCC AOTCCTCTOC OCOCCCTOAT 4620 

TTCCTCTTCC TOTOAOOCCA CCAOOCCTOC TOOAAACACO CCTOCCTOCO CAOCTTCACA 4630 

COACCTTTOT CATCTCTTTA AAOOCCATOT CTCCAOAOTC ATOTOTTOAA OTTCTOOOOO 4740 

TTAOTOOOAC ACAOTTCAOC CCCTAAAAOA OTCTCTCTOC CCCTCAAATT TTCCCCACCT 4300 

CCAOCCATOT CTCCCCAAOA TCCAAATOTT OCTACATOTO OOOOOOCTCA TCTOOOTCCC 4160 

TCTTTOOOTT CAOTOTOAOT CTOOOOAOAO CATTCCCCAO OOTOCAOAOT TOOOOOOAOT 4920 

ATCTCAOOOC TOCCCAOOCC OOOOTOOOAC AOAOAOCCCA CTOTOOOOCT OOOOOCCCCT 4930 
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TCCCACCCCC AOAOTOCAAC TCAAOOTCCC TCTCCAOOTO OCOOOOACTT OOCACTCCTT 5040 

OOCTATOOCO OCCAOCOACA TCTCCCTOCT OOATOCCCAO AOTOCCCCCC TOAOAOTOTA 3 100 

COTOOAOOAO CTOAAOCCCA CCCCCOAOOO CAACCTOOAO ATCCTOCTOC AOAAATOOTO 3 160 

OOCOTCTCTC CCCAACATOO AACCCCCACT CCCCAOOOCT OTOOACCCCC COOOOOOTOO 3 220 

OOTOCAOOAO OOACCAOOOC CCCAOOOCTO OOOAAOAOOO CTCAOAOTTT ACTOOTACCC 5 2 8 0 

OOCOCTCCAC CCAAOOCTOC CCACCCAOOO CTTTTTTTTT TTTTAAACTT TTATTAATTT 5340 

OATOCTTCAO AACATCATCA AACAAATOAA CATAAAACAT TCATTTTTOT TTACTTOOAA 3400 

OOOOAOATAA AATCCTCTGA AGTOGAAATO CATAOCAAAO ATACATACAA TOAOOCAOOT 5 46 0 

ATTCTOAATT CCCTOTTAOT CTOAOOATTA CAAOTOTATT TOAOCAACAO AOAOACATTT 3 3 20 

TCATCATTTC TAOTCTOAAC ACCTCAOTAT CTAAAATOAA CAAOAAOTCC TOOAAACOAA 5310 

OCAOTOTOOO OATAOOCCCO TOTOAAOOCT OCTOOOAGOC AOCAOACCTO OOTCTTCOOG 5640 

CTCAAOCAGT TCCCOCTACC AOCCCTGTCC ACCTCAOACO OOOOTCAOOO TOCAOOAOAO 5700 

AOCTOOATOO OTOTOOOGOC AOAOATOOOO ACCTGAACCC CAOOOCTOCC TTTTOOOOOT 5760 

OCCTOTOOTC AAOGCTCTCC CTOACCTTTT CTCTCTOOCT TCATCTOACT TCTCCTOOCC 3820 

CATCCACCCO OTCCCCTOTO OCCTOAOOTO ACAGTOAOTO COCCOAOOCT AOTTOOCCAO 5180 

CTOOCTCCTA TOCCCATOCC ACCCCCCTCC AOCCCTCCTO OOCCAOCTTC TOCCCCTOOC 5940 

CCTCAOTTCA TCCTGATOAA AATOOTCCAT OCCAATOOCT CAOAAAOCAO CTOTCTTTCA 6000 

GOOAOAACOO COAOTOTOCT CAOAAOAAGA TTATTOCAOA AAAAACCAAO ATCCCTOCOO 6060 

TOTTCAAOAT COATOOTOAO TCCOOOTCCC TOOGOOACAC CCACCACCCC COCCCCCOOO 6 120 

OACTOTOOAC aoottcaooo ooctoocotc OOOCCCTOOG ATOCTAAOOO ACTOOTOOTO 6 180 

ATOAAGACAC TOCCTTOACA CCTOCTTCAC TTOCCTCCCC TOCCACCTOC CCOOOOCCTT 6240 

OOOOCOOTOO CCATOOGCAO OTCCCOOCTO OCOOOCTAAC CCACCAOOOT OACACCCOAO 6300 

CTCTCTTTOC TOOOOGOCOO OCOOTOCTCT OOOCCCTCAO OCTOAOCTCA OOAOGTACCT 6360 

OTOCCCTCCC AOOOOTAACC OAOAOCCOTT OCCCACTCCA OOOOCCCAOO TOCCCCACOA 6420 

CCCCAOCCCO CTCCACAOCT CCTTCATCTC CTOOAOACAA ACTCTOTCCO CCCTCOCTCA 6480 

TTCACTTOTT COTCCTAAAT CCOAOATOAT AAAOCTTCOA OOOOOOOTTO OOOTTCCATC 6540 

AOOOCTOCCC TTCCOCCOOO CAOCCTOOOC CACATCTOCC CTTOOCCCCC TCAOOACTCA 6600 

CTCTOACTOO AOOCCCTOCA CTOACTOACO CCAOOOTOCC CAOCCCAOOO TCTCTOOCOC 6660 

CATCCAOCT.O CACTOOOTTT OOOTOCTOOT CCTOCCCCCA AOCTOCCCOO ACACCACAOO 6 7 2 0 

CAOCCOOOOC TOCCCACTOO CCTCOOTCAO OOTOAOCCCC AOCTOCCCCC OCTCAOOOCT 6780 

TOCCCCOACA ATOACCCCAT CCTCAOOACO CACCCCCCTT CCCTTOCTOO OCAOTOTCCA 6840 

OCCCCACCCO AOATCOOOOO AAOCCCTATT TCTTOACAAC TCCAOTCCCT OOOOOAOOOO 6900 

OCCTCAOACT OAOTOOTOAO TOTTCCCAAO TCCAOOAOOT OOTOOAOOOT CCTOOCOOAT 6960 

CCAOAOTTOA CAOTOAOOOC TTCCTOOOCC CCATOCOCCT OOCAOTOOCA OCAOOOAAOA 7020 

OOAAOCACCA TTTCAOOOOT OOOOOATOCC AOAOOCOCTC CCCACCCCOT CTTCOCCOOO 7080 

TOOTOACCCC OOOOOAOCCC COCTOOTCOT OOAOOOTOCT OOOOOCTOAC TAOCAACCCC 7140 

TCCCCCCCCO TTOOAACTCA CTTTTCTCCC OTCTTOACCO COTCCAOCCT TOAATOAOAA 7200 

CAAAOTCCTT OTOCTOOACA CCOACTACAA AAAOTACCTO CTCTTCTOCA TOOAAAACAO 7260 

TOCTOAOCCC OAOCAAAOCC TOOCCTOCCA OTOCCTOOOT OOOTOCCAAC CCTOOCTOCC 7320 

CAOOOAOACC AOCTOCOTOO TCCTTOCTOC AACAOOOOOT OOOOOOTOOO AOCTTOATCC 73 80 
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CCAOOAOOAO OAOOOOTOOO OOOTCCCTOA OTCCCOCCAO OAOAOAOTOO TCOCATACCO 7440 

OOAOCCAOTC TOCTOTOOOC CTOTOOOTOO CTOOOOACOO OOOCCAOACA CACAOOCCOO 73 00 

OAOACOOOTO OOCTOCAOAA CTOTOACTOO TOTOACCOTC OCOATOOOOC COOTOOTCAC 7360 

TOAATCTAAC AOCCTTTOTT ACCOOOOAOT TTCAATTATT TCCC AAAAT A AOAACTCAOO 7620 

TACAAAOCCA TCTTTC AACT ATCACATCCT OAAAACAAAT OOCAOOTOAC ATTTTCTOTO 76*0 

CCOTAOCAOT CCCACTOOOC ATTTTCAOOO CCCCTOTOCC AOOOOOOCOC OOOCATCOOC 7740 

OAOTOOAOOC TCCTOOCTOT OTCAOCCOOC CCAOOOOOAO OAAOOOACCC OOACAOCCAO 7100 

AOOTOOOOOO CAOOCTTTCC CCCTOTOACC TOCAOACCCA CTOCACTOCC CTOOOAOOAA 7S60 

OOOAOOOOAA CTAOOCCAAO OOOOAAOOOC AOOTOCTCTO OAOOOCAAOO OCAOACCTOC 7920 

AOACCACCCT OOOOAOCAOO OACTOACCCC COTCCCIOCC CCATAOTCAO OACCCCOOAO 79»0 

OTOOACAACO AOOCCCTOOA OAAATTCOAC AAAOCCCTCA AOOCCCTOCC CATOCACATC $040 

COOCTTOCCT TCAACCCOAC CCAOCTOOAO OOTOAOCACC CAOOCCCCOC CCTTCCCCAO $100 

OOCAOOAOCC ACCCOOCCCC OOOACOACCT CCTCCCATOO TOACCCCCAO CTCCCCAOOC »160 

CTCCCAOOAO OAAOOOOTOO OOTOCAOCAC CCCOTOOOOO CCCCCTCCCC ACCCCCTOCC S220 

AOOCCTCTCT TCCCOAOOTO TCCAOTCCCA TCCTOACCCC CCCATOACTC TCCCTCCCCC *2«0 

ACAOOOCAOT OCCACOTCTA OOTOAOCCCC TOCCOOTOCC TCTOOOOTAA OCTOCCTOCC §340 

CTOCCCCACO TCCTOOOCAC ACACATOOOO TAOOOOOTCT TOOTOOOOCC TOOOACCCCA MOO 

CATCAOOCCC TOOOOTCCCC CCTOTOAOAA TOOCTOOAAO CTOOOOTCCC TCCTOOCOAC i460 

TOCAOAOCTO OCTOOCCOCO TOCCACTCTT OTOOOTOACC TOTOTCCTOO CCTCACACAC 1520 

TOACCTCCTC CAOCTCCTTC CAOCAOAOCT AAOOCTAAOT OAOCCAOAAT OOTACCTAAO »3«0 

OOOAOOCTAO COOTCCTTCT CCCOAOOAOO OOCTOTCCTO OAACCACCAO CCATOOAOAO S640 

OCTOOCAAOO OTCTOOCAOO TOCCCCAOOA ATCACAOOOO OOCCCCATOT CCATTTCAOO 1700 

OCCCOOOAOC CTTOOACTCC TCTOOOOACA OACOACOTCA CCACCOCCCC CCCCCCATCA 8760 

OOOOOACTAC AAOOOACCAO OACTOCAOTC ACCCTTCCTO OOACCCAOOC CCCTCCAOOC 1120 

CCCTCCTOOO OCTCCTOCTC TOOOCAOCTT CTCCTTCACC AATAAAOOCA TAAACCTOTO 1110 

CTCTCCCTTC TOAOTCTTTO CTOOACOACO OOCAOOOOOT OOAOAAOTOO TOOOOAOOOA 1940 

OTCTOOCTCA OAOOATOACA OCOOOOCTOO OATCCAOOOC OTCTOCATCA CAOTCTTOTO 9000 

ACAACTOOOO OCCCACACAC ATCACTOCOO CTCTTTOAAA CTTTCAOOAA CCAOOOAOOO 9060 

ACTCOOCAOA OACATCTOCC AOTTCACTTO OAOTOTTCAO TCAACACCCA AACTCOACAA 9120 

AOOACAOAAA OTOOAAAATO OCTOTCTCTT AOTCT AATAA ATATTOATAT OAAACTCAAO 9110 

TTOCTCATOO ATCAAT ATOC CTTTATOATC CAOCCAOCCA CTACTOTCOT ATCAACTCAT 9240 

OTACCCAAAC OCACTOATCT OTCTOOCTAA TOATOAOAOA TTCCCAOTAO AOAOCTOOCA 9300 

AOAOOTCACA OTOAOAACTO TCTOCACACA CAOCAOAOTC CACCAOTCAT CCTAAOOAOA 9360 

TCAOTCCTOO TOTTCATTOO AOOACTOATO TTOAAOCTOA AACTCCAATO CTTTOOCCAC 9420 

CTO ATOTOA A OAOCTOACTC ATTTOAAAAO ACCCTOATOC TOOOAAAOAT TOAOOOCAOO 9410 

AOOAOAAOOO OACOACAOAO OATOAQATOO TTOOATOOCA TCACCAACAC AATOOACATO 9340 

OOTTTOOOTO OACTCCAOOA OTTOOTOATO OACAOOOAOO CCTOOCOTOC TACOOAAOCO 9600 

OTTTATOOOO TCACAAAOAC TOAOTOACTO AACTOAOCTO AACTOAATOO AAATOAOOTA 9660 

TACAOCAAAG TOOOOATTTT TTAOATAATA AOAATATACA CATAACATAO TOTATACTCA 9720 

TATTTTTATO CATACCTOAA TOCTCAOTCA CTCAOTCOTA TCTOACTCTO TOACCTATOO 9710 
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ACCOTAOCCT TCCAOOTTTC TTCTOTCCAC AOAATTCTCC AAOOCAAOAA TACTOOAOTO 9*40 

OOTAOCCATT TCCTCCTCCA OOOOATCCTC CCOACCCAOO OATTOAACCO OCATCTCCTO 9900 

TATTOOCAOO TOO ATTC TT T ACCACTOTOC CACCAOOOAA OCCCOTOTTA CTCTCTATOT 9960 

CCCACTTAAT TACCAAAOCT OCTCCAAOAA AAAOCCCCTO TOCCCTCTOA OCTTCCCOOC 100 20 

CTOCAOAOOO TOOTOOOOOT AOACTOTOAC CTOOOAACAC CCTCCCOCTT CAOOACTCCC 1O0I0 

OOOCCACOTO ACCCACAOTC CTOCAOACAO CCOOOTAOCT CTOCTCTTCA AOOCTCATTA 10140 

TCTTTAAAAA AAACTOAOOT CTATTTTOTO ACTTCOCTOC COTAACTTCT OAACATCCAO 10200 

TOCOATOOAC AOOACCTCCT CCCCAOOCCT CAOOOOCTTC AOOOAOCCAO CCTTCACCTA 10260 

TOAOTCACCA OACACTCOOO OQTOOCCCCO CCTTCAOOOT OCTCACAOTC TTCCCATCOT 103 20 

CCTOATCAAA OAOCAAOACC AATOACTTCT TAOOAOCAAO CAOACACCCA CAOOACACTO 103S0 

AOOTTCACCA OAOCTOAOCT OTCCTTTTOA ACCTAAAOAC ACACAOCTCT COAAOOTTTT 10440 

CTCTTTAATC TOOATTTAAO OCCTACTTOC CCCTCAAOAO OOAAOACAOT CCTOCATOTC 10500 

CCCAOOACAO CCACTCOOTO OCATCCOAOO CCACTTAOTA TTATCTOACC OCACCCTOOA 10S60 

ATT AATCOOT CCAAACTOOA CAAAAACCTT OOTOOOAAOT TTCATCCCAO AOOCCTCAAC 10620 

CATCCTOCTT TOACCACCCT OCATCTTTTT TTCTTTT ATO TOTATOCATO TATATATATA 10610 

TATATATTTT TTTTTTTTTC ATTTTTTOOC TOTOCTOOCT OTTCOTTOCA OTTCOOTOCO 10740 

CAOOCTTCTC TCTAOTTTCT CTCTAOTCTT CTCTTATCAC AOAOCAOTCT CTAOACOATC 10*00 

OACOCOT 10*07 



( 2 ) INFORMATION FOR SEQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 base pn 
( B ) TYPE: wckic aod 
( C ) STRANDEDNESS: «gk 
( D )TOPOLOOY:bae« 

( i i ) SEQUENCE DESCRIPTION: SBQ ID NO* 

AATTCCOATC OACOCOTCOA COATATACTC TAOACOATCO ACOCOTA 

( 2 ) INFORMATION FOR SBQ ID N0^9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 baa* 
( B ) TYPE: onclbc mad 
( C ) STRANDEDNESS: tmjfe 
( D ) TOPOLOGY: hmcm 

(»u ) IMMEDIATE SOURCE: 

( B ) CLONE: BLOAMP3 

( x i > SEQUENCE DESCRIPTION: SBQ ID NO* 

TOOATCCCCT OCCOOTOCCT CTOO 

( 2 ) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 base poi 
( B ) TYPE: aKkic acid 
( C ) STRANDEDNESS: «m«k 
( D )TC*OLOOY:l»e* 

( * i i ) IMMEDIATE SOURCE: 

( B ) CLONE: BLOAMP4 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACOCOTCAT CCTCTOTOAO CCAG 



2 4 
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( 2 ) INFORMATION FOR SEQ ID NO*.U: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 fante pan 
( B ) TYPE: mdac mad 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOGY: fine* 

(Tii ) IMMEDIATE SOURCE: 

( B ) CLONE: 206839 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: II: 

ACTACOTAOT 10 

( 2 ) INFORMATION FOR SEQ ID NO:12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 42 btse pexn 
( B ) TYPE: noclbc add 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOGY: bnenr 

(vii ) IMMEDIATE SOURCE: 

< B ) CLONE: 208632 

( z i ) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

COACOCOOAT CCTACOTACC TOCAOCCATO TTTTCCATOA OG 42 



( 2 ) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 owe pan 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: fine* 

(Tii ) IMMEDIATE SOURCE: 

( B ) CLONE: 206627 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AOOOCTTCOO CAAOCTTCAG O 21 



( 2 ) INFORMATION FOR SEQ ID NO: 14. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 hwe pan 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: ongk 
( D )TOPOLOOY:bncar 

(Tii ) IMMEDIATE SOURCE: 

( B ) CLONE: 206521 

(si) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

OCCAAAOACT TACTTCCCTC T AO A 2* 



( 2 ) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 bate pan 
( B ) TYPE: nockic nod 
( C ) STRANDEDNESS: ugk 
( D ) TOPOLOGY: finenr 

(Tii ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6320 

SEQUENCE DESCRIPTION: SEQ ID NO. 13: 



OCATOAACOT COCOTOOTOO TTOTOCTACC 



# 
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( 2 ) INFORMATION TOR SBQ ID NOU6: 

( i ) SBQUENCB CHARACTERISTICS: 
( A ) LENGTH: 30 haac pn 
( B JTYTOaodricarid 
( C ) STRANDEDNESS: smjk 
( D ) TOPOLOOY: baear 

( v i i ) IMMEDIATE SOURCE: 

(B ) CLONE 206519 

< x i ) SEQUENCE DESCRIPTION: SBQ ID NOl6: 

ACCACOCOAC OTTCATOCTC TAAAACCOTT 30 



( 2 ) INFORMATION FOR SEQ ID NO-.17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 bate pain 
( B ) TYPE: anckic acid 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOOY: linear 

(vii) IMMEDIATE SOURCE: 

( B ) CLONE; Z06S1S 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

OCTOCOOOAT CCTACOTACT AOOOOGACAO OOAAOO 36 

( 2 ) INFORMATION FOR SBQ ID NO-.I8: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTB: 45 faaae pan 
( B ) TYPE: snclek add 
( C ) STRANDEDNESS: angle 
( D )TOPOUX3Y:facar 

( v I i ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6629 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

COACOCOAAT TCTACOTACC TOCAOCCATO AAAAOOATOO TTTCT 4 3 

( 2 ) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 baas pan 
( B ) TYPE: anckk acid 
( C ) STRANDEDNESS: nagk 
( D ) TOPOLOOY: haear 

(Mi ) IMMEDIATE SOURCE: 

( B ) CLONE: ZO6630 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.19: 

COACOCOAAT TCTACOTACC TOCAOCCATO AAACATCTAT TATTO 45 

( 2 ) INFORMATION FOR SBQ ID NO20: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 bate pain 
( B ) TYPE: anckic aod 
( C ) STRANDEDNESS: ungk 
( D ) TOPOLOOY: haear 

(Tit ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6623 

( x t ) SEQUENCE DESCRIPTION: SEQ ID NO30: 



OTOAOATTTT CAOATCTTOT C 
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( 2 )rNP0RMATICNP0RSBQIDN021: 

( a ) SEQUENCE CHARACTERISTiCS: 
( A ) LENGTH: 21 faaat pain 
( B ) TYPE: sockac add 
( C ) STRANDEDNESS: da«W 
( D ) TOPOLOGY: baear 

( f i i ) IMMEDIAtE SOURCE: 

(B ) CLONE: 206626 

( x i ) SEQUENCE DESOIPTMN: SBQ ID NC*31: 

AACAATTACT OTOOCCTACC A 21 



( 2 ) INFORMATION FOR SBQ ID N022: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 bam pairs 
( B ) TYPE: aockk add 
( C ) STRANDEDNESS: naafe 
( D ) TOPOLOGY: hmm 

(Tii ) IMMEDIATE SOURCE: 
f?| ( B ) CLONE: Z06624 

=Q ( x i ) SEQUENCE DESCRIPTION: SEQ ID N02Z: 

m 

OCTOCOOAAT TCTACOTACT ATTOCTOTOO OA A 

i £ £ 

1^ ( 2 ) INFORMATION FOR SBQ ID N023: 

S ,|T ( i ) SEQUENCE CSARACTERISTJCS: 

( A ) LENGTH: 45 bam pan 
i ^ ( B ) TYPE: aickfc add 

( C ) STRANDEDNESS: M»k 

( D )TOP0UXJY:6acar 

^ ( v i i ) IMMEDIATE SOURCE: 

M: ( B ) CLONE: Z06514 

H ( x i )SB0^ENCBDESC3UPTK3N:SBQn>NO2S: 

y j COACOCOOAT CCTACOTACC TOCAOCCATO AOTTOOTCCT TOCAC 



( 2 ) INFORMATION FOR SBQ ID N034: 

( i )SBQUENCB CHARACTERISTICS: 
( A ) LENGTH: 21 taw pan 
( B ) TYPE: Kicktc add 
( C ) STRANDEDNESS: 
( D ) TOPOLOGY: hmm 

(Mi) IMMEDIATE SOURCE: 

( B ) CLONE: ao6517 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N034: 

OTCTCTOOTA OCAACATACT A 



( 2 ) INFORMATION FOR SEQ ID N025: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 bmm pan 
( B ) TYPE: aoddc add 
( C ) STRANDEDNESS: dmals 
( D ) TOPOLOGY: fiaear 

) IMMEDIATE SOURCE: 

( B ) CLONE: x£5l6 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N023: 



OOOTTTCTAO CCCTACTAOT AO 



22 
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( 2 ) INFORMATION FOR SEQ ID N036: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 baae pan 
( B ) TYPE: nckac acid 
( C ) STRANDEDNESS: oarfe 
( D ) TOPOLOGY: liaear 

( t i i ) IMMEDIATE SOURCE: 

( B ) CLONE: »6515 

( z i ) SEQUENCE DESCRIPTION: SEQ ID N026: 

OOOTTTCTAO CCCTACTAOT AO 22 

( 2 ) INFORMATION FOR SEQ ID N027: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 baae pan 
( B ) TYPE: aackic acid 
( C ) STRANDEDNESS: male 
( D ) TOPOLOGY: fiaear 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N037: 

AAOCTACOCG TCOATCOTCT AOAOTATATC OTCOACOCOT COATCOO 47 



